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© In a multi-location television conference system 
that connects five locations A, B, C, D, and E ( when 
speeches take place at the four locations A, B, C, 
and D at the same time, at a listening location E, 
images of all the speaking locations A, B, C, and D 
are displayed on one screen with four divided screen 
areas. On the other hand, at the speaking location A, 
images of the speaking locations B, c, and D and an 


image of the former speaking location E are dis- 
played on one screen with four divided screen areas. 
In addition, when images of speaking locations are 
displayed, locations names thereof are also dis- 
played. Thus, a television conference held at a plu- 
rality of locations at a time can be smoothly man- 
aged as with a real conventional conference. 
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The present invention relates to a television 
conference system for use in a multi-location tele- 
vision conference (also called videoconferencing). 

In recent years, with the advent of ISDN (In- 
tegrated Services Digital Networks), digital net- 
works have been widely used and television con- 
ference systems have become popular. 

The television conference systems transmit im- 
age data and sound data among remote locations 
so as to hold conferences. Conventional television 
conference systems transmit image data and 
sound data between two locations (namely, point- 
to-point connection). Recently, to more effectively 
hold conferences, a multi-location television con- 
ference system as shown in Fig. 1 has gained 
public attention. The needs of multi-location televi- 
sion conference systems result from geographical 
distribution of offices (such as main office, works, 
laboratories, sales offices) of companies. In this 
situation, so as to reduce cost and time necessary 
for transmitting information, people, and materials, 
television conferences have been held among tele- 
vision conference terminal units disposed at remote 
locations. 

Fig. 1 is a schematic diagram showing an 
overall construction of a multi-location television 
conference system that holds a television con- 
ference among conference rooms A, B, C, and D 
disposed at four locations. 

In the drawing, reference numerals 1 to 4 refer 
to television conference terminal units disposed at 
the four conference rooms. Reference numeral 10 
refers to a multi-location control apparatus dis- 
posed at a center station. 

The television conference terminal units (that 
may be hereinafter referred to as terminal units) 1 
to 4 each have a camera, a monitor, a microphone, 
an encoding unit, and so forth. Thus, the television 
conference terminal units each have input and out- 
put functions of image data and sound data. Image 
data and sound data of each conference room are 
sent to the center unit 10. The center unit 10 sends 
the image data and sound data to each of the 
television conference terminal units. 

The center unit 10 receives image data and 
sound data from all the conference rooms, mixes 
all the sound data, sends the mixed sound data to 
the television conference terminal units 1 to 4, and 
distributes the image data to all the television con- 
ference terminal units. 

In this multi-location television conference sys- 
tem, a monitor of the television conference terminal 
unit at each conference room (location) switches 
and displays images of other conference rooms (of 
at least one location). When a speaking location is 
detected, the image thereof is automatically dis- 
played. Thus, the monitor of each conference room 
displays at least an image of a speaking location. 


(However, the monitor of a speaking location dis- 
plays an image of a former speaking location.) 

A switching and displaying type multi-location 
control apparatus that distributes an image of one 

5 conference room to television conference terminal 
units of other conference rooms is known. This 
multi-location control apparatus can be easily intro- 
duced because the television conference terminal 
units for use with the point-to-point connection type 

10 television conference system can be used as they 
are. Thus, the switching and displaying type multi- 
location control apparatus has been widely devel- 
oped and fabricated. 

Moreover, a mufti- location control apparatus 

75 having new station (NS)/past station (PS) designat- 
ing mode is known. 

Conventionally, an image switching operation in 
the NS/PS designating mode is automatically per- 
formed when the television conference terminal unit 

20 at each location is operated. Alternately, this opera- 
tion is automatically performed when the multi- 
location control apparatus detects a speaking atten- 
dant. This function is referred to as the speaking 
attendant detecting function. With this function, the 

25 multi-location control apparatus distributes image 
data of the speaking location (NS) to the television 
conference terminal units at other locations. In ad- 
dition, the multi-location control apparatus distrib- 
utes the image data of the former speaking location 

30 (PS) to the television conference terminal unit at 
the speaking location (NS). 

Fig. 2 is a schematic diagram showing an 
example of the operation of the NS/PS designating 
mode. In this drawing, a multi-location control ap- 

35 paratus 20 is connected to television conference 
terminal units 12A, 12B, 12C, 12D, and 12E dis- 
posed at five locations of conference rooms A, B, 
C, D, and E through a network. 

In this television conference system, when an 

40 attendant speaks at the conference room A or an 
image switching and selecting operation for an 
image of the conference room A is performed at 
another conference room, the multi-location control 
apparatus 20 designates the conference room A to 

45 the NS mode and sends the image data of the 
conference room A to the television conference 
terminal units at other conference rooms B, C, D, 
and E (see a solid arrow of Fig. 2). In addition, the 
multi-location control apparatus 200 sends the im- 

50 age data of the conference room D that has been 
designated to the PS mode to the conference room 
A that has been designated to the NS mode (see a 
dotted arrow of Fig. 2). 

Fig. 3 is a block diagram showing an overall 

55 construction of a multi-location television confer- 
ence system having a multi-location control appara- 
tus 110 that automatically designates the confer- 
ence rooms A, B, .... and E to the NS/PS mode 
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depending on detecting a speaking attendant and 
switches and controls image data to be distributed 
to the television conference terminal units 120A, 
120B and 120E. 

As shown in Rg. 3 t each of the television 
conference terminal units 120 A to 120E comprises 
a plurality of microphones 121, a microphone mixer 
122, a speaker 123, a sound control unit 124, a 
sound encoding and decoding unit (hereinafter re- 
ferred to as sound CODEC) 125, a camera 126, a 
monitor 127, an image CODEC 128, and a mul- 
tiplexing and demultiplexing unit (MUX/DM UX) 129. 
The microphones 121 are directed to attendants in 
a conference room. The microphone mixer 122 
mixes outputs of the microphones 121. The speak- 
er 123 outputs a sound of another television con- 
ference terminal unit 120. The sound control unit 
124 performs an input/output sound switching op- 
eration, an echo removing operation, or the like so 
as to prevent the output sound of the speaker 123 
from entering the microphones 121. The sound 
CODEC 125 encodes sound data received from the 
sound control unit 123 and decodes sound data 
sent to the sound control unit 123. The camera 126 
photographs the entire scene of the conference 
room, attendants, and/or materials. The monitor 
127 displays images of other conference rooms 
received from other television conference terminal 
units 120. The image CODEC 128 encodes image 
data photographed by the camera 126 and de- 
codes encoded data to an image to be displayed 
on the monitor 127. The MUX/DMUX unit 129 mul- 
tiplexes and demultiplexes an input signal and an 
output signal of the sound CODEC 125 and the 
image CODEC 128. 

The television conference terminal units 120 A 
to 120E with the above-mentioned construction are 
connected to the multi-location control apparatus 
110. 

The multi-location control apparatus 110 com- 
prises a multiplexing and demultiplexing 
(MUX/DMUX) unit 111, a sound CODEC 112, a 
sound level comparator 113, a mixer 114, an image 
selecting unit 115, and a control unit 116. The 
MUX/DMUX unit 111 multiplexes and demultiplexes 
sound data and image data received from and sent 
to each of the television conference terminal units 
120A to 120E. The sound CODEC 112 is con- 
nected to the MUX/DMUX unit 111. The sound 
level comparator 113 compares levels of sound 
signals received from the sound CODECs 112 of 
the television conference terminal units 120A to 
120E. The mixer 114 mixes the sound signals 
compared by the comparator 113 and sends the 
mixed sound signal to the speaker 123 of each of 
the television conference terminal unit 120A to 
120E through the sound CODEC 112 and the 
MUX/DMUX unit 111. The image selecting unit 140 


selects image data demultiplexed by the 
MUX/DMUX unit 111 and sends the demultiplexed 
image data to the monitor 127 of each of the 
television conference terminal units 120A to 120E. 

5 The control unit 116 sends a selecting signal to the 
image selecting unit 140 corresponding to the com- 
pared result of the sound level comparator 1 13. 

In this related art reference, the levels of the 
microphones 121 of the television conference ter- 

10 minal units 120 A to 120E are compared by the 
sound level comparator 113 of the multi-location 
control apparatus 110. The multi-location control 
apparatus 110 designates the television conference 
terminal unit that outputs the maximum sound level 

75 to the NS mode. The image selecting unit 140 
selects image data received from the television 
conference terminal unit with the maximum sound 
level and distributes this image data to other televi- 
sion conference terminal units. 

20 Fig. 4 is a block diagram showing an inner 

construction of the image selecting unit 140. 

Image data of the conference rooms A, B, C, 
.... and X at n locations are input to the input 
terminal 141. The image data are sent to an NS 

25 selecting unit 142 and a PS selecting unit 143 that 
select one from n alternatives. The NS selecting 
unit 142 and the PS selecting unit 143 receive a 
selecting signal from the control unit 116 and send 
image data of conference rooms that have been 

30 designated the n NS/PS selecting units 144A, 
144B, 144C, and 144X to the NS and PS 
modes. 

The output terminals of the NS/PS selecting 
units 144 A, 144B, 144C and 144X are con- 

35 nected to respective image data output terminals 
145 of the conference rooms A, B, C, ... , and X so 
as to select and output image data of the con- 
ference rooms that have been designated to the 
NS mode or PS mode. 

40 In the image selecting unit 140 with the above- 
mentioned construction, when the conference room 
A is designated to the NS mode and the con- 
ference room E is designated to the PS mode, the 
control unit 116 outputs a selecting signal of the 

45 image data of the conference room A to the NS 
selecting unit 142 and a selecting signal of the 
image data of the conference room E to the PS 
selecting unit 143. The NS selecting unit 142 and 
the PS selecting unit 143 select and output image 

so data received from the conference rooms A and B 

to the NS/PS selecting units 144A, 144B and 

144X. The NS/PS selecting unit 144A selects and 
outputs the image data of the conference room E, 
which has been designated the PS mode. The 

55 other NS/PS selecting units 144B, 144C and 

144X receive image data of the conference room 
A, which has been designated to the NS mode, 
from the control unit 116 and selects and outputs 
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the image data. Thus, the image of the conference 
room A, which has been designated to the NS 
mode, is displayed on the respective monitors at 

the conference rooms B, C and X, which have 

not been designated to the NS mode. On the other 
hand, the image of the conference room E, which 
has been designated to the PS mode, is displayed 
on the monitor at the conference room A, which 
has been designated to the NS mode. 

In the first related art reference, image data is 
displayed in a fixed display format regardless of 
the number of speaking locations. In other words, 
the number of images of speaking locations dis- 
played on a monitor at each conference room does 
not vary. Thus, when the system displays an image 
of one location, even if there are a plurality of 
speaking locations, only an image of a main speak- 
ing location is displayed. On the other hand, when 
the system displays a plurality of locations at a 
time, even if there is one speaking location, divided 
screen images of listening locations are also dis- 
played on the monitor. 

Thus, a listening attendant of a conference 
needs a time to knows who is speaking. In this 
situation, the reality of conference is deteriorated. 

A first object of the present invention is to 
provide a display system that allows each atten- 
dant of a television conference to easily know 
speaking locations with reality. 

In the related art multi-location television con- 
ference system having the conventional NS/PS 
designating mode, when in conference rooms that 
have not been designated to the NS mode, a 
conference room is selected, since this selected 
conference room is designated to the NS mode, 
monitor images of other conference rooms are 
switched. On the other hand, in the system that 
automatically designates the NS mode when de- 
tecting a speaking attendant, whenever a speaking 
location (conference room) is changed, the monitor 
image data is automatically switched regardless of 
the intention of listening attendants. Thus, these 
attendants cannot continue to see images of other 
conference rooms other than those of speaking 
locations. 

Thus, in the conventional multi-location control 
apparatus with the designating NS/PS modes, at- 
tendants of other conference rooms cannot select 
images of conference rooms they want to see. 
When an attendant of another conference room 
changes the mode or speaks, the image thereof is 
automatically displayed on the monitor screens of 
other conference rooms. This automatic switching 
of images of conference rooms adversely affects 
the proceeding of the conference. 

A second object of the present invention is to 
hold a television conference at multiple locations 
with reality similar to a real convention. 


Since any related art references of multi-loca- 
tion television conference systems display only im- 
age data of a conference room, attendants of the 
conference cannot easily know who is speaking at 

5 what conference room. Thus, the conference can- 
not be smoothly progressed. 

A third object of the present invention is to 
allow each attendant to easily know background 
information (such as conference room name, loca- 

70 tion name, name and title of speaking attendant, 
and the like) with respect to image data of a 
conference room displayed on the monitor screen. 

A first aspect of the present invention is a 
multi-location control apparatus for use with a 

rs multi-location television conference system that has 
a plurality of multi-location television conference 
terminal units disposed at three or more locations, 
each of said television conference terminal units 
being adapted for transmitting and receiving sound 

20 data and image data, said multi-location control 
apparatus being adapted for receiving sound data 
and image data from all of said television con- 
ference terminal units and for transmitting sound 
data and image data to said television conference 

25 terminal units, said multi-location control apparatus 
comprising speaking location detecting unit for de- 
tecting a speaking location corresponding to sound 
signals that are output from sound input unit of 
each of said television conference terminal units, 

30 image combining unit for generating image data 
containing images of speaking locations when a 
plurality of speaking locations are detected by said 
speaking location detecting unit and image distrib- 
uting unit for transmitting image data generated by 

35 said image combining unit to each of said televi- 
sion conference terminal units. 

A second aspect of the present invention is a 
multi-location control apparatus for use with a 
multi-location television conference system that has 

40 a plurality of multi-location television conference 
terminal units disposed at three or more locations, 
each of said television conference terminal units 
being adapted for transmitting and receiving sound 
data and image data, said multi-location control 

45 apparatus being adapted for switching and control- 
ling sound data and image data transmitted among 
all of said television conference terminal units, said 
multi-location control apparatus comprising switch- 
ing and outputting unit for inputting image data of 

so said conference rooms from all of said television 
conference terminal units and for switching and 
outputting image data of any conference room to 
each of said television conference terminal units 
independent from other television conference ter- 

55 minal units and control unit for controlling said 
switching and outputting unit so that when a switch- 
ing request for displaying an image of a desired 
conference room is issued from a particular televi- 
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sion conference terminal unit, image data to be 
transmitted to this particular television conference 
terminal unit is switched to the desired image data 
by the particular television conference terminal unit. 

A third aspect of the present invention is a 
multi-location control apparatus for use with a 
multi-location television conference system that has 
a plurality of multi-location television conference 
terminal units disposed at three or more locations, 
each of said television conference terminal units 
being adapted for transmitting and receiving sound 
data and image data, said multi-location control 
apparatus being adapted for switching and control- 
ling sound data and image data transmitted among 
all of said television conference terminal units, said 
multi-location control apparatus comprising mode 
designating unit having a first mode and a second 
mode, said first mode being adapted for selecting 
image data of a conference room displayed at a 
television conference terminal unit independent 
from other television conference terminal units, said 
second mode being adapted for forcedly selecting 
image data of a conference room displayed at a 
television conference terminal unit corresponding to 
an occurrence of a predetermined event of another 
television conference terminal unit, mode designat- 
ing unit for designating either said first mode or 
said second mode to each of said conference 
rooms corresponding to a mode designating re- 
quest received from a television conference termi- 
nal unit of each of said conference rooms switching 
and outputting unit for inputting image data of said 
conference rooms from all of said television con- 
ference terminal units and for switching and output- 
ting image data of any conference room to each of 
said television conference terminal units indepen- 
dent from other television conference terminal 
units, first control unit for controlling said switching 
and outputting unit so that when a switching re- 
quest for displaying an image of a desired con- 
ference room is issued from a particular television 
conference terminal unit that has been designated 
to said first mode, image data to be transmitted to 
this particular television conference terminal unit is 
switched to the desired image data by the particu- 
lar television conference terminal unit, and second 
control unit for controlling said switching and out- 
putting unit so that when said event takes place in 
a particular television conference terminal unit, im- 
age data to be transmitted to television conference 
terminal units that have been designated to said 
second mode is switched to image data of the 
television conference terminal unit. 

A fourth aspect of the present invention is a 
multi-location control apparatus for use with a 
multi-location television conference system that has 
a plurality of multi-location television conference 
terminal units disposed at three or more locations, 


each of said television conference terminal units 
being adapted for transmitting and receiving sound 
data and image data, said multi-location control 
apparatus being adapted for switching and control- 

5 ling sound data and image data transmitted among 
all of said television conference terminal units, said 
multi-location control apparatus comprising code 
generating unit for generating a background in- 
formation code corresponding to a background in- 
fo formation number according to an image of a con- 
ference room that has been encoded and received 
from a television conference terminal unit of the 
conference room that has been switched and se- 
lected, and multiplexing and distributing unit for 

75 multiplexing the encoded data of the image signal 
of the conference room received from the television 
conference terminal unit of the conference room 
that has been switched and selected and the code 
generated by said code generating unit and for 

20 distributing the multiplexed signal to all the televi- 
sion conference terminal units. 

A fifth aspect of the present invention is a 
multi-location control apparatus for use with a 
multi-location television conference system that has 

25 a plurality of multi-location television conference 
terminal units disposed at three or more locations, 
each of said television conference terminal units 
being adapted for transmitting and receiving sound 
. data and image data, said multi-location control 

30 apparatus being adapted for switching and control- 
ling sound data and image data transmitted among 
all of said television conference terminal units, said 
multi-location control apparatus comprising image 
generating unit for generating a background in- 

35 formation code corresponding to a background in- 
formation number according to an image of a con- 
ference room that has been encoded and received 
from a television conference terminal unit of the 
conference room that has been switched and se- 

40 iected, image decoding unit for decoding the en- 
coded data of the image signal of the conference 
room received from the television conference ter- 
minal unit of the conference room that has been 
switched and selected to an image signal, image 

45 superimposing unit for superimposing the image 
signal generated by said image generating unit 
over the image signal decoded by said image 
decoding unit and, encoding unit for encoding the 
image signal generated by said image superim- 

50 posing unit. 

One skilled in the art can easily understand 
additional features and objects of this invention 
from the description of the preferred embodiments 
and some of the attached drawings. In the draw- 

55 ings: 

Rg. 1 is a schematic diagram showing overall 
construction of a multi-location television con- 
ference system; 
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Fig. 2 is a schematic diagram for explaining the 
operation of an NS/PS designating mode; 
Fig. 3 is a block diagram showing a construction 
of a conventional multi-location television con- 
ference system; 5 
Fig. 4 is a block diagram showing an inner 
construction of a multi-location control apparatus 
having a conventional NS/PS designating mode; 
Fig. 5 is a block diagram showing a principle of 
construction of the present invention (No. 1); io 
Fig. 6 is a block diagram showing a principle of 
construction of the present invention (No. 2); 
Fig. 7 is a block diagram showing a principle of 
construction of the present invention (No. 3); 
Fig. 8 is a block diagram showing a principle of is 
construction of the present invention (No. 4); 
Fig. 9 is a block diagram showing a principle of 
construction of the present invention (No. 5); 
Fig. 10 is a block diagram showing a principle of 
construction of the present invention (No. 6); 20 
Fig. 11 is a block diagram showing a principle of 
construction of the present invention (No. 7); 
Fig. 12 is a block diagram showing overall con- 
struction of a multi-location television conference 
system according to a first embodiment of the 25 
present invention; 

Fig. 13 is a block diagram showing detailed 
construction of the multi-location conference 
system according to the first embodiment; 
Fig. 14 is a block diagram for explaining a 30 
principle of operation of the first embodiment; 
Fig. 15 is a schematic diagram for explaining 
display method in the case that speeches take 
place at four locations at the same time; 
Fig. 16 is a schematic diagram for explaining 35 
display method in the case that speeches take 
place at two locations at the same time; 
Fig. 17 is a schematic diagram for explaining 
display method in the case that a speech takes 
place at one location; 40 
Fig. 18 is a schematic diagram showing an 
example of transmitting method of an image 
signal from each television conference terminal 
unit to the multi-location control apparatus; 
Fig. 19 is a schematic diagram showing an 45 
example of generating method of an image sig- 
nal displayed on four divided areas of one 
screen; 

Fig. 20 is a flow chart for explaining overall 

operation of the first embodiment; 50 

Fig. 21 is a flow chart for explaining operation in 

the case that speeches take place at a plurality 

of locations at the same time; 

Fig. 22 is a flow chart for explaining operation in 

the case that a speech takes place at one loca- 55 

tion; 

Fig. 23 is a block diagram showing overall con- 
struction of a multi-location television conference 


system according to a second embodiment of 
the present invention; 

Fig. 24 is a schematic diagram showing detailed 
construction of an image selecting unit; 
Fig. 25 is a schematic diagram for explaining 
outlined operation of the second embodiment; 
Fig. 26 is a schematic diagram showing inner 
construction of a mode management table; 
Fig. 27 is a schematic diagram showing inner 
construction of an NS/PS management table; 
Fig. 28 is a schematic diagram showing inner 
construction of a displaying conference room 
management table; 

Fig. 29 is a schematic diagram showing inner 
construction of a matrix switcher status table; 
Fig. 30 is a flow chart showing operation cor- 
responding to an occurrence of a speaking at- 
tendant detecting event; 

Fig. 31 is a schematic diagram for explaining 
table rewrite operation corresponding to a detec- 
tion of a speaking attendant; 
Fig. 32 is a flow chart of displaying conference 
room switching operation; 
Fig. 33 is a schematic diagram for explaining 
table rewrite operation corresponding to a dis- 
playing conference room switching operation 
(No. 1); 

Fig. 34 is a schematic diagrams for explaining 
table rewrite operation corresponding to a dis- 
playing conference room switching operation 
(No. 2); 

Fig. 35 is a schematic diagram for explaining 
table rewrite operation corresponding to a dis- 
playing conference room switching operation 
(No. 3); 

Fig. 36 is a flow chart of designated mode 
changing operation; 

Fig. 37 is a schematic diagrams for explaining 
table rewrite operation corresponding to des- 
ignated mode changing operation (No. 1); 
Fig. 38 is a schematic diagram for explaining 
tale rewrite operation corresponding to desig- 
nated mode changing operation (No. 2); 
Fig. 39 is a flow chart of automatic conference 
room switching enable/disable designating op- 
eration; 

Fig. 40 is a flow chart of remote location camera 
control operation; 

Fig. 41 is a flow chart of designated mode 
forced changing operation; 
Fig. 42 is a schematic diagram showing overall 
construction of a multi-location television con- 
ference system according to a third embodiment 
of the present invention; 

Fig. 43 is a schematic diagram showing an 
example of a screen display according to the 
third embodiment; 


7 


11 EP 0 619 679 A1 12 


Fig. 44 is a block diagram showing an example 
of a construction of a first system according to 
the third embodiment; 

Fig. 45 is a block diagram showing an example 
of a construction of a second system according 
to the third embodiment; 
Fig. 46 is a block diagram showing an example 
of a construction of a third system according to 
the third embodiment; and 
Fig. 47 is a block diagram showing an example 
of a construction of a fourth system according to 
the third embodiment. 
Fig. 1 is a block diagram showing a principle of 
construction according to a first aspect of the 
present invention. 

In this drawing, reference numerals 161-1, 161- 
2, 161-3 and 161-N are television conference termi- 
nal units that receive and transmit sound data and 
image data. The first aspect of the present inven- 
tion is a multi-location control apparatus for use 
with a multi-location television conference system 
that has a plurality of multi-location television con- 
ference terminal units 161 -i (where i = 1,2 

and N) disposed at three or more locations, each of 
the television conference terminal units being 
adapted for transmitting and receiving sound data 
and image data, the multi-location control appara- 
tus being adapted for receiving sound data and 
image data from all of the television conference 
terminal units 161-i and for transmitting sound data 
and image data to the television conference termi- 
nal units 161-i. 

The multi-location control apparatus 170 com- 
prises speaking location detecting unit 171 for de- 
tecting a speaking location corresponding to sound 
signals that are output from sound input units of 
each of the television conference terminal units 
161-i, image combining unit 172 for generating 
image data containing images of speaking locations 
when a plurality of speaking locations are detected 
by the speaking location detecting unit 171, and 
image distributing unit 173 for transmitting image 
data generated by the image combining unit 172 to 
each of the television conference terminal units 
161-i. 

In the multi-location control apparatus, the 
speaking location detecting unit 171 detects occur- 
rences of speaking locations with sound input sig- 
nals received from the television conference termi- 
nal units 161-1, 161-2, .... and 161-N. The image 
combining unit 172 divides and combines image 
signals of the speaking locations so as to generate 
an image signal that allows images of speaking 
locations to be displayed on divided screen areas 
of a monitor at a time. The image distributing unit 
173 distributes the image signal to each of the 
television conference terminal units 161-i (where i 
= 1,2,..., and N). Thus, when there are a plurality 


of speaking locations, images of the speaking loca- 
tions are displayed on monitor screen of each of 
the television conference terminal units 161-i. Cor- 
responding to the number of speaking locations, 

5 the number of images of speaking locations dis- 
played on each monitor varies. 

The image combining unit 1 72 may be adapted 
for generating image data of speaking locations 
detected by the speaking location detecting unit 

10 171 for listening locations where speeches are not 
detected by the speaking location detecting unit 
171, the image data being contained in one screen, 
and the image distributing unit 173 may be adapt- 
ed for distributing the image data to television 

75 conference terminal units 161-i of the listening lo- 
cations. 

In this case, for listening locations, the image 
combining unit 172 generates image data that con- 
tains only images of speaking locations in one 
20 screen. The image distributing unit 173 distributes 
the image data to terminal units 161-i at listening 
locations. 

Thus, when there are a plurality of speaking 
locations, images of the speaking locations are 

25 simultaneously displayed at terminal units 161-i at 
listening locations. 

In addition, the image combining unit 172 may 
be adapted for generating image data of other 
speaking locations and a former speaking location 

30 for each of the speaking locations, the image data 
being contained in one screen, and the image 
distributing unit 173 may be adapted for transmit- 
ting the image data to television conference termi- 
nal units 161-i of the speaking locations. 

35 In this case, for speaking locations, the image 

combining unit 172 generates image data that con- 
tains images of other speaking locations and an 
image of a former speaking location in one screen. 
The image distributing unit 173 distributes the im- 

40 age data to each of the speaking locations. 

Thus, each of the television conference termi- 
nal units 161-i at the speaking locations displays 
images of other speaking locations and an image 
of the former speaking location at a time. 

45 Fig. 6 is a block diagram showing the principle 
of construction according to a second aspect of the 
present invention. 

The second aspect of the present invention is a 
multi-location control apparatus 230 for use with a 

50 multi-location television conference system that has 
a plurality of multi-location television conference 

terminal units 200-i (where i = 1,2 and N) 

disposed at three or more locations, each of the 
television conference terminal units being adapted 

55 for transmitting and receiving sound data and im- 
age data, the multi-location control apparatus being 
adapted for switching and controlling sound data 
and image data transmitted among all of the televi- 
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sion conference terminal units 200-i. 

The multi-location control apparatus comprises 
switching and outputting unit 231 for inputting im- 
age data of the conference rooms from all of the 
television conference terminal units 200-1, 200-2, 

and 20O-N and for switching and outputting 
image data of any conference room to each of the 
television conference terminal units 200-i indepen- 
dent from other television conference terminal units 
200-j (where j * i t j = 1,2,..., and N), and control 
unit 232 for controlling the switching and outputting 
unit 231 so that when a switching request for 
displaying an image of a desired conference room 
is issued from a particular television conference 
terminal unit 200-i, image data to be transmitted to 
this particular television conference terminal unit 
200-i is switched to the desired image data by the 
particular television conference terminal unit 200-i. 

According to the second aspect of the present 
invention, when a television conference terminal 

unit 200-i (where i = 1,2 and N) issues an 

image switching request to the control unit 232, the 
control unit 232 controls the switching and output- 
ting unit 231 so that only image data to be trans- 
mitted to the terminal 200-i is switched to image 
data of a conference room designated by this re- 
quest. 

Thus, when the television conference terminal 
unit 200-i of each conference room is operated so 
as to issue an image switching request for a desire 
conference room, only the image of the conference 
room that issues the request is switched to the 
image of the desired conference room. At this 
point, images displayed at other conference rooms 
are not switched. 

Fig. 7 is a block diagram showing a principle of 
construction according to a third aspect of the 
present invention. The third aspect of the present 
invention is a multi-location control apparatus for 
use with a multi-location television conference sys- 
tem that has a plurality of multi-location television 
conference terminal units 340-i (where i = 1 , 2, ... f 
and N) disposed at three or more locations, each of 
the television conference terminal units being 
adapted for transmitting and receiving sound data 
and image data, the multi-location control appara- 
tus being adapted for switching and controlling 
sound data and image data transmitted among all 
of the television conference terminal units 340-i, 
the multi-location control apparatus comprising 
mode designating unit having a first mode and a 
second mode, the first mode being adapted for 
selecting image data of a conference room dis- 
played at a television conference terminal unit 340-i 
independent from other television conference ter- 
minal units 340-j (where j * i, j = 1, 2, ... and , N), 
the second mode being adapted for forcedly se- 
lecting image data of a conference room displayed 


at a television conference terminal unit 340-j cor- 
responding to an occurrence of a predetermined 
event of another television conference terminal unit 
340-i, mode designating unit 351 for designating 

s either the first mode or the second mode to each 
of the conference rooms corresponding to a mode 
designating request received from a television con- 
ference terminal unit 340-i of each of the con- 
ference rooms, switching and outputting unit 352 

io for inputting image data of the conference rooms 
from all of the television conference terminal units 
340-1, 340-2, ... , and 340-N and for switching and 
outputting image data of any conference room to 
each of the television conference terminal units 

is 340-i independent from other television conference 

terminal units 340-j (where j * i, j = 1,2 and 

N), first control unit 353 for controlling the switch- 
ing and outputting unit 352 so that when a switch- 
ing request for displaying an image of a desired 

20 conference room is issued from a particular televi- 
sion conference terminal unit 340-i that has been 
designated to the first mode, image data to be 
transmitted to this particular television conference 
terminal unit 340-i is switched to the desired image 

25 data by the particular television conference termi- 
nal unit 340-i, and second control unit 354 for 
controlling the switching and outputting unit 352 so 
that when the event takes place in a particular 
television conference terminal unit 340-i, image 

30 data to be transmitted to television conference ter- 
minal units 340-k (where k = 1,2, ... , and N) that 
have been designated to the second mode is 
switched to image data of the television conference 
terminal unit 340-i. 

35 The predetermined event is a detection of a 

speaking conference room, an image switching re- 
quest operation performed by a television con- 
ference terminal unit 340-i at a conference room, or 
the like. 

40 The operation of the third aspect of the present 
invention is as follows. 

When a mode designating operation is per- 
formed with a television conference terminal unit 
340-i at a particular conference room, the mode 

45 designating unit 351 designates the conference 
room to the designated mode. 

When an image switching request for a desired 
conference room is issued with a television con- 
ference terminal unit 340-i at a conference room 

so that has been designated to the first mode, the first 
control unit 353 controls the switching and output- 
ting unit 352 so that only image data to be sent to 
the television conference terminal unit 340-i of the 
conference room that has issued the switching 

55 request is switched to image data of the requested 
conference room. 

When a particular event (such as speech or 
image switching operation) takes place in a televi- 


15 


EP 0 619 679 A1 


16 


sion conference terminal unit 340-i at a particular 
conference room, the second control unit 354 con- 
trols the switching and outputting unit 352 so that 
only image data of conference rooms that have 
been designated to the second mode is switched 
to image data of a conference room corresponding 
to the event. 

Thus, when a television conference terminal 
unit 340-i is operated at a conference room that 
has been designated to the first mode, an image of 
a desired conference room can be selected in- 
dependently of the other conference rooms. In ad- 
dition, the image of the desired conference room is 
not switched even if a speaking location takes 
place or an image switching operation is performed 
at another conference room. 

In this construction, the mode designating unit 
351 may be adapted for designating an automatic 
switching enable mode to a conference room when 
receiving an automatic switching request from a 
television conference terminal unit 340-i of the con- 
ference room that has been designated to the 
second mode, the second control unit may be 
adapted for controlling the switching and outputting 
unit 352 so that when a speech is detected by a 
television conference terminal unit 340-i at a par- 
ticular conference room, image data to be transmit- 
ted to television conference terminal units 340-k 
(where k = 1,2,... ,and N) of conference rooms 
that have been designated to the second mode and 
the automatic switching enable mode is switched to 
image data received from the television conference 
terminal unit 340-i. 

In this case, when an automatic switching en- 
able designating operation is performed at a televi- 
sion conference terminal unit 340-i at a conference 
room that has been designated to the second 
mode, the mode designating unit 351 designates 
this conference room to this mode. 

When a speech is detected at a conference 
room in the case that image data is sent to terminal 
units 340-i of conference rooms that have been 
designated to the first mode or the second mode 
and that have been designated to the automatic 
switching mode, the second control unit 354 con- 
trols the switching and outputting unit 352 in the 
same manner as the case that such an event is 
detected. However, the second control unit 354 
controls the switching and outputting unit 352 so 
that image data of conference rooms where 
speeches are detected is sent to terminal units 
340-j at conference rooms that have been des- 
ignated to the second mode, but not the automatic 
switching enable mode. 

Thus, in a television conference terminal unit 
340-i at a conference room that has been des- 
ignated to the second mode and the automatic 
switching enable mode, only when a speech is 


detected, an image of this speaking location is 
automatically displayed. 

Fig. 7 also shows principle construction of the 
fourth and sixth aspects of the present invention. 

5 The fourth aspect of the present invention is 

the multi-location television conference system 
having units 351 , 352, 353, and 354, further com- 
prising main conference room designating unit 355 
for selecting and designating a main conference 

10 room from all of the conference rooms and mode 
changing unit for forcedly changing modes of all of 
the conference rooms managed by the mode des- 
ignating unit 351 to either the first mode or the 
second mode designated by a television confor- 
ms ence terminal unit 340-i at a particular conference 
room when the particular conference room is the 
main conference room. 

According to the fourth aspect of the present 
invention, the main conference room designating 

20 unit 355 selects and designates a main conference 
room from all the conference rooms in the system. 

When a forced mode change request to the 
first mode or the second mode is issued by an 
operation of a television conference terminal unit 

25 340-i at a particular conference room, the mode 
changing unit 356 accepts this request only when 
this conference room has been designated to a 
main conference room. After accepting this re- 
quest, the mode changing unit 356 forcedly 

30 changes the present modes of all other conference 
rooms to a designated mode. 

Thus, at the conference room that has been 
designated to the main conference room, the 
modes of all other conference rooms in the system 

35 can be forcedly changed to a desired mode by the 
television conference terminal unit 340-i. 

The fifth aspect of the present invention is the 
multi-location control apparatus of the third aspect 
having units 351 , 352, 353, and 354 or of the fourth 

40 aspect having units 351, 352, 353, 354, 355, and 
356, further comprising displaying conference room 
managing unit 357 for receiving switching informa- 
tion from the first and second control units 353 and 
354 so as to store conference rooms displayed on 

45 television conference terminal units 340- i at the 
conference rooms 340-i, and transferring unit 358 
for seeking a conference room whose image is 
being displayed on a television conference terminal 
unit 340-i at a particular conference room with 

so reference to information stored in the displaying 
conference room managing unit 357 and for trans- 
ferring the camera control information to the televi- 
sion conference terminal unit 340-j when receiving 
camera control information from the television con- 

55 ference terminal unit 340-i at the particular con- 
ference room. 

In this case, the displaying conference room 
managing unit 357 receives switching information 
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of the switching and outputting unit 352 from the 
first and second control units 353 and 354 and 
stores data of conference rooms currently dis- 
played at each conference room. 

Thus, when remote location camera control in- 
formation is sent from a conference room by an 
operation of a television conference terminal unit 
340-i, the transferring unit 358 references the dis- 
playing conference room managing unit 357 and 
transfers the remote location camera control in- 
formation to the television conference terminal unit 
340-j at this conference room. 

Thus, cameras of conference rooms whose im- 
ages are displayed can be remotely controlled 
from television conference terminal units 340-i at all 
other conference rooms. 

Fig. 8 is a block diagram showing a principle of 
construction of sixth and seventh aspects. 

The sixth aspect of the invention is a television 
conference terminal apparatus 460, comprising im- 
age superimposing unit 461 for generating an im- 
age signal where an image of background informa- 
tion is superimposed over an image of a con- 
ference room photographed by a camera, and en- 
coding unit 462 for encoding an image signal gen- 
erated by the image superimposing unit 461 . 

The seventh aspect of the present invention is 
a television conference terminal unit 5601, compris- 
ing decoding unit 565 for decoding data encoded 
by the encoding unit 462 of the television con- 
ference terminal unit 460 of the sixth aspect and 
image displaying unit 566 for displaying an image 
corresponding to an image signal decoded by the 
decoding unit 565. 

In the television conference terminal unit 460, 
the image superimposing unit 461 superimposes 
an image signal of background information (such 
as a location name, a conference room name, a 
title of a speaking attendant, and so forth) over an 
image signal of a conference room photographed 
by a camera. The superimposed signal is encoded 
by the encoding unit 462. The encoded data is 
sent to the multi-location control apparatus 500. 

On the other hand, when the television con- 
ference terminal unit 560 receives encoded data 
from the multi-location control apparatus 500, the 
decoding unit 565 decodes the encoded data to an 
image signal. The image displaying unit 566 dis- 
plays the image signal as an image. 

Thus, when the multi-location television con- 
ference system is constructed of the television 
conference terminal units 460 and 560, an image of 
each conference room and background information 
(location name and the like) thereof can be dis- 
played at a time. 

Fig. 9 is a block diagram showing a principle of 
construction of eighth and ninth aspects of the 
present invention. 


The eighth aspect of the present invention is a 
television conference terminal unit 670, comprising 
encoding unit 671 for encoding an image signal of 
a conference room photographed by a camera and 

5 multiplexing unit 672 for multiplexing data encoded 
by the encoding unit 671 and background informa- 
tion code of the image of the conference room 
photographed by the camera. 

The ninth aspect of the present invention is a 

w television conference terminal unit 770, comprising 
demultiplexing unit 775 for demultiplexing the mul- 
tiplexed data generated by the multiplexing unit 
672 of the television conference terminal unit 670 
into the encoded data of the image signal of the 

T5 conference room and the background information 
code, image decoding unit 776 for decoding the 
encoded data of the image signal of the conference 
room demultiplexed by the demultiplexing unit 775 
to an image signal, image generating unit 777 for 

20 generating an image signal corresponding to the 
background information code demultiplexed by the 
demultiplexing unit 776, and image superimposing 
unit 778 for superimposing the image signal gen- 
erated by the image generating unit 777 over the 

25 image signal decoded by the image decoding unit 
776. 

In the television conference terminal unit 670, 
the encoding unit 671 encodes an image signal of 
a conference room that has been photographed by 
30 a camera. The multiplexing unit 672 multiplexes 
the encoded data with a code of background in- 
formation thereof. The multiplexed data is sent to 
the television conference terminal units 770 of the 
present invention through the multi-location control 
35 apparatus 700. 

When the television conference terminal unit 
770 receives the multiplexed data, the demultiplex- 
ing unit 775 demultiplexes the multiplexed data into 
a code of background information of a conference 
40 room and encoded data of an image signal thereof. 
The image decoding unit 776 decodes the en- 
coded data to an image signal of a conference 
room. The image generating unit 777 generates an 
image signal corresponding to the code. The image 
45 superimposing unit 778 superimposes the image 
signal of the code over the image signal of the 
conference room. The superimposed image is dis- 
played on a display unit. 

Thus, when the multi-location television con- 
so ference system is constructed of the above-men- 
tioned two television conference terminal units 670 
and 770, an image of a conference room and 
background information thereof can be displayed 
as a superimposed image at each of conference 
55 rooms. 

Fig. 10 is a block diagram showing a principle 
of construction of tenth and eleventh aspects of the 
present invention. 
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The tenth aspect of the present invention is a 
mufti-location control apparatus 880 for use with a 
mufti-location television conference system that has 
a plurality of multi-location television conference 
terminal units disposed at three or more locations, 
each of the television conference terminal units 
being adapted for transmitting and receiving sound 
data and image data, the multi-location control ap- 
paratus 880 being adapted for switching and con- 
trolling sound data and image data transmitted 
among all of the television conference terminal 
units, the multi-location control apparatus compris- 
ing code generating unit 881 for generating a back- 
ground information code corresponding to a back- 
ground information number according to an image 
of a conference room that has been encoded and 
received from a television conference terminal unit 
of the conference room that has been switched and 
selected, and multiplexing and distributing unit 882 
for multiplexing the encoded data of the image 
signal of the conference room received from the 
television conference terminal unit of the confer- 
ence room that has been switched and selected 
and the code generated by the code generating 
unit 881 and for distributing the multiplexed signal 
to all the television conference terminal units. 

The code generating unit 881 references a 
database that stores conference room numbers and 
corresponding background information (for exam- 
ple, location names) in a manner of one-to-one 
correspondence so as to convert a conference 
room number to the corresponding background in- 
formation. 

The eleventh aspect of the present invention is 
a television conference terminal unit 980, compris- 
ing signal demultiplexing unit 985 for demultiplex- 
ing the multiplexed signal received from the multi- 
location control apparatus 880 into the encoded 
data of the image signal of the conference room 
and the background information code of the image 
of the conference room, image decoding unit 986 
for decoding the encoded data demultiplexed by 
the signal demultiplexing unit 985 into the image 
signal of the conference room, image generating 
unit 987 for generating the image signal of the 
background information corresponding to the code 
demultiplexed by the signal demultiplexing unit 
985, and image superimposing unit 988 for super- 
imposing the image signal generated by the back- 
ground information image generating unit 987 over 
the image signal decoded by the image decoding 
unit 986. 

In the mufti-location control apparatus 880, the 
code generating unit 881 generates a code of 
background information corresponding to a back- 
ground information number received from a televi- 
sion conference terminal unit 800. The multiplexing 
and distributing unit 882 multiplexes the code and 


the encoded data of the image signal and distrib- 
utes the multiplexed signal to each of the television 
conference terminal units 980. 

In the television conference terminal unit 980, 

5 the signal demultiplexing unit 985 demultiplexes 
the multiplexed signal received from the multi-loca- 
tion control apparatus into encoded data of an 
image of a conference room that has been 
switched and selected and a code of background 

io information thereof. The image decoding unit 986 
decodes the encoded data. The image generating 
unit 987 generates an image signal of the back- 
ground information corresponding to the code. The 
image superimposing unit 988 superimposes the 

75 image signal of the background information over 
the image signal of the conference room. The 
superimposed image is displayed on the display 
unit. 

Thus, when the mufti-location television con- 

20 ference system is constructed of the television 
conference terminal units 980 and the multi-loca- 
tion control apparatus 880, an image of a con- 
ference room and background information can be 
displayed as a superimposed image at each of 

25 conference rooms. 

Rg. 1 1 is a block diagram showing a principle 
of construction of twelfth and thirteenth aspects of 
the present invention. 

The twelfth aspect of the present invention is a 

30 multi-location control apparatus for use with a 
multi-location television conference system that has 
a plurality of mufti-location television conference 
terminal units disposed at three or more locations, 
each of the television conference terminal units 

as being adapted for transmitting and receiving sound 
data and image data, the multi-location control ap- 
paratus being adapted for switching and controlling 
sound data and image data transmitted among all 
of the television conference terminal units. 

40 The multi-location control apparatus 990 of the 

twelfth aspect comprises image generating unit 991 
for generating in image signal of a background 
information code corresponding to a background 
information number according to an image of a 

45 conference room that has been encoded and re- 
ceived from a television conference terminal unit of 
the conference room that has been switched and 
selected, image decoding unit 992 for decoding the 
encoded data of the image signal of the conference 

so room received from the television conference ter- 
minal unit of the conference room that has been 
switched and selected to an image signal, image 
superimposing unit 993 for superimposing the im- 
age signal generated by the image generating unit 

55 991 over the image signal decoded by the image 
decoding unit 992, and encoding unit 994 for en- 
coding the image signal generated by the image 
superimposing unit 993. 
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The thirteenth aspect of the present invention 
is a television conference terminal unit 1000, com- 
prising image decoding unit 1095 for decoding the 
encoded data obtained by the encoding unit 994 of 
the mutti-location control apparatus 990 to an im- 
age signal and image displaying unit 1096 for 
displaying an image corresponding to the image 
signal decoded by the image decoding unit 1095. 

The code of background information is a loca- 
tion name, a conference room name, a section 
name of a speaking attendant, and the like. 

In the multi-location control apparatus 990, the 
image generating unit 991 generates an image 
signal of background information corresponding to 
a background information number received from a 
television conference terminal unit 1050 at a con- 
ference room that has been switched and selected. 
The image decoding unit 992 decodes encoded 
data of an image signal of a conference room 
received from a television conference terminal unit 
that has been switched and selected. The image 
superimposing unit 993 superimposes the image 
signal of the background information over the im- 
age signal of the conference room. The encoding 
unit 994 encodes the superimposed image signal. 
A predetermined distributing unit distributes the 
encoded data to each of the television conference 
terminal units 1000. 

In each of the television conference terminal 
units 1000, the image decoding unit 1095 decodes 
the encoded data into an image signal where an 
image signal of background information has been 
superimposed over an image of a conference 
room. The image displaying unit 1096 displays the 
superimposed image. 

Thus, when the multi-location television con- 
ference system is constructed of the multi-location 
control apparatus 990 and the television conference 
terminal units 1000, an image of a conference 
room and background information thereof can be 
displayed as a superimposed image at each of 
conference rooms. 

Fig. 12 is a schematic diagram showing an 
overall construction of a multi-location television 
conference system according to a first embodiment 
of the present invention. 

In this drawing, a multi-location control appara- 
tus 1100 is disposed in a center station. The multi- 
location control apparatus 1100 is connected to 
television conference terminal units 1200 (1200A, 
1200B, 1200C, 1200D, and 1200E) of conference 
rooms disposed at five locations A, B, C, D, and E 
through a network such as ISDN or high speed 
digital dedicated line. 

Fig. 13 is a block diagram showing a detailed 
construction of the multi-location television con- 
ference system. 


Each of the television conference terminal units 
1200A to 1200E comprises a plurality of micro- 
phones 1201, a microphone mixer 1202, a speaker 
1203, a sound control unit 1204, a sound encoding 

5 and decoding unit (sound CODEC) 1205, a camera 
1206, a monitor 1207, an image CODEC 1208, and 
a multiplexing and demultiplexing unit 
(MUX7DMUX) 1209. The microphones 1201 are di- 
rected to attendants in a conference room. The 

10 microphone mixer 1202 mixes outputs of the mi- 
crophones 1201. The speaker 1203 outputs a 
sound of another television conference terminal unit 
1200. The sound control unit 1204 performs an 
input/output sound switching operation, an echo 

75 removing operation, or the like so as to prevent the 
output sound of the speaker 1203 from entering the 
microphones 1201. The sound CODEC 1205 en- 
codes sound data received from the sound control 
unit 1204 and decodes sound data sent to the 

20 sound control unit 1204. The camera 1206 pho- 
tographs the entire scene of the conference room, 
attendants, and/or materials. The monitor 1207 dis- 
plays images of other conference rooms received 
from other television conference terminal units 

25 1200. The image CODEC 1208 encodes image 
data photographed by the camera 1206 and de- 
codes encoded data to an image to be displayed 
on the monitor 1207. The MUX/DMUX unit 1209 
multiplexes and demultiplexes an input signal and 

30 an output signal of the sound CODEC 1205 and the 
image CODEC 1208. 

The television conference terminal units 1200 A 
to 1 200E with the above-described construction are 
connected to the multi-location control apparatus 

35 1100. The multi-location control apparatus 1100 
comprises a multiplexing and demultiplexing 
(MUX/DMUX) unit 1101, a sound CODEC 1102, a 
sound detecting unit 1103, a sound mixer 1104, an 
image selecting unit 1105, and a screen combining 

40 unit 1106. The MUX/DMUX unit 1101 multiplexes 
sound data and image data received from the tele- 
vision conference terminal units 1200 A to 1200E. In 
addition, the MUX/DMUX unit 1101 demultiplexes 
the multiplexed data into sound data and image 

45 data to be sent to the television conference termi- 
nal units 1200A to 1200E. The sound CODEC 1102 
is connected to the MUX/DMUX unit 1101. The 
sound detecting unit 1103 detects occurrences of 
speeches at the locations A to E with levels of 

so sound signals received from the sound CODECS 
1102 of the television conference terminal units 
1200A to 1200E. The sound mixer 1104 mixes the 
sound signals of the speaking locations detected 
by the sound detecting unit 1103 and sends the 

55 mixed sound signal to the speakers 1203 of the 
television conference terminal units 1200 A to 
1200E through the sound CODEC 1102 and the 
MUX/DMUX unit 1101. The image selecting unit 
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1105 receives sound signals from the sound de- 
tecting unit 1103, selects image data to be com- 
bined into one screen from image data of the 
locations A to E, and outputs image combining 
information. The image combining unit 1106 gen- 
erates image data composed of a plurality of im- 
ages of conference rooms received from the image 
selecting unit 1105 corresponding to the image 
combining information and distributes the com- 
bined image data to television conference terminal 
units 1 200 of conference rooms at designated loca- 
tions through- the MUX/DM UX unit 1 101. 

The sound detecting unit 1103 detects a maxi- 
mum number of speaking locations (for example, 
four locations), where images thereof can be dis- 
played on the monitors 1207 of the television con- 
ference terminal units 1200A to 1200E. The sound 
detecting unit 1103 detects whether or not a sound 
level of each location exceeds a predetermined 
threshold value. In addition, the sound detecting 
unit 1103 determines whether or not the number of 
speaking locations exceeds the maximum number 
of locations. When the number of speaking loca- 
tions exceeds the maximum number of locations, 
the speaking locations with larger sound levels are 
selected for the maximum number of locations. 
Thus, the locations A to E are categorized as 
speaking locations and listening locations. 

The image selecting unit 1105 and the image 
combining unit 1106 combine image data of all the 
speaking locations that have been selected into 
image data of one screen. 

Image data displayed at each speaking location 
differs from image data displayed at each listening 
location. Rg. 14 shows input/output interface of 
image data and sound data between the multi- 
location control apparatus 1100 and each of the 
television conference terminal units 1200A to 
1200E. In reality, as will be described later, an 
image that is output to each speaking location 
differs, location by location. 

Sound data that the sound mixer 1104 outputs 
to speaking locations differs from sound data that 
the sound mixer 1104 outputs to listening locations. 
Likewise, image data that the image combining unit 

1106 outputs to the speaking locations differs from 
image data that the image combining unit 1106 
outputs to the listening locations. The sound mixer 
1 1 04 mixes sounds corresponding to images of the 
locations A to E. The mixing operation of the sound 
mixer 1104 is based on image combining informa- 
tion received from the image selecting unit 1 1 05. 

Next, an example of display method in the 
case that attendants speak at the locations A to E 
will be described. Rg. 15 shows display screens 
(monitor screens) on monitors 1207 of television 
conference terminal units 1200 in the case that 
attendants at four locations (in this case, locations 


A, B, C, and D) speak at the same time. Rg. 15 (a) 
shows a display screen at a listening location (in 
this case, location E). Rg. 15 (b) shows a display 
screen at a speaking location (in this case, location 
5 A). 

In this case, since there are four speaking 
locations, the monitors 1207 at the locations A to E 
each display four divided screen images. In other 
words, the monitors 1207 at the listening locations 

10 each display four divided screen images showing 
images of four locations A, B, C, and D. The 
images displayed on the monitors 1207 at the 
listening locations are the same. 

On the other hand, the monitor 1200 at the 

75 speaking location A displays images of three 
speaking locations B, C, and D and an image of the 
former speaking location E. As with the display 
screen of Rg. 15 (b), the monitors 1207 at the 
speaking locations B, C, and D displays images of 

20 speaking locations (other than the local location) 
and an image of the former speaking location E. 

Rg. 16 shows monitor screens in the case that 
attendants speak at two locations (in this case, 
locations A and B). 

25 In this case, since there are two speaking loca- 
tions, the monitors 1207 each display divided 
screen images of the two speaking locations. 

As shown in Rg. 16 (b), the monitors 1207 at 
listening locations C, D, and E each display divided 

30 screen images of the speaking locations A and B. 
On the other hand, as shown in Rg. 16 (b), the 
monitor 1207 at the speaking location A displays 
divided screen images of the speaking location B 
and the former speaking location E. Likewise, the 

35 monitor 1207 at the speaking location B displays 
divided screen images of the other speaking loca- 
tion and the former speaking location E. 

Thus, in the case that there are two speaking 
locations, the monitor 1207 at a listening location 

40 displays divided screen images of two speaking 
locations. On the other hand, the monitor 1207 at 
the speaking location displays divided screen im- 
ages of the other speaking location and the former 
speaking location. 

45 Fig. 17 shows display screens of monitors in 

the case that there is one speaking location (loca- 
tion A). 

In this case, the monitor screen is not divided. 
As shown in Rg. 17 (a), the monitors 1207 at 

50 listening locations B, C, D, and E each display an 
image of the speaking location A only. On the other 
hand, as shown in Rg. 17 (b), the monitor 1207 at 
the speaking location A displays an image of the 
former speaking location E only. 

55 Thus, in the case that there is only one speak- 

ing location, the monitors 1207 at the listening 
locations each display an image of the speaking 
location only. On the other hand, the monitor 1207 
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at the speaking location displays an image of the 
former speaking location only. 

However, in CCITT (Consultative Committee for 
International Telephony and Telegraphy) Recom- 
mendation H. 261 , which is motion picture commu- 
nication image encoding standard using primary 
subrate from 64 kb/s to 2 Mb/s, common intermedi- 
ate format has been defined as image codec stan- 
dard for visual telephone and television conference. 
In addition, for low-bit rate simple terminal units 
such as small visual telephones, QCIF (Quarter 
CIF) where the horizontal and vertical resolutions 
are the half of the CIF has been also defined. 

Fig. 18 is a schematic diagram for explaining a 
method for transmitting image data from the televi- 
sion conference terminal units 1200 A to 1200E at 
locations A to E to the multi-location control ap- 
paratus 1100 with CIF. Fig. 19 is a schematic 
diagram for explaining a method for displaying 
images of four locations in four divided screen 
areas at a time as shown in Fig. 15 (a). 

The multi-location control apparatus 1100 is 
provided with a CIF/QCIF converter that converts 
image data of locations A, B, C, and D of CIF 
format into data of QCIF format. The image data of 
the QCIF format at the four locations are combined 
to one screen so that the images of the four 
locations A, B, C, and D shown in Fig. 15 (a) are 
displayed on one screen at a time. The combined 
data is sent to the television conference terminal 
unit 1200E at the location E. 

When there is one speaking location as shown 
in Fig. 17, the multi-location control apparatus 1100 
selects and outputs the received image data of the 
CIF format to the television conference terminal 
units 1200 at the listening locations through the 
image selecting unit 1105 and the image combin- 
ing unit 1106, not through the CIF/QCIF converter. 

Next, with reference to flow charts shown in 
Figs. 20 to 22, an image output process (image 
data transmitting process) of the multi-location con- 
trol apparatus 1100 for the television conference 
terminal units 1200 A to 1200E at the locations A to 
E will be described. 

Fig. 20 is a main flow chart showing the image 
output process. 

In this drawing, the sound detecting unit 1103 
detects the number of speaking locations (at step 
S1). The multi-location control apparatus 1100 de- 
termines whether or not the number of speaking 
locations is one according to the detected result (at 
step S12). When there are a plurality of speaking 
locations (determined result at step S12 is NO), the 
multi-location control apparatus 1100 performs a 
multi-image output process that causes images of 
a plurality of locations to be displayed as divided 
screen images on the monitors 1207 of the televi- 
sion conference terminal units 1200 at the locations 
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A to E (at step S13). When there is one speaking 
location (the determined result at step S12 is YES), 
the apparatus 1100 performs a single image output 
process that causes an image of one speaking 
5 location to be displayed on the monitors 1207 at 
the locations A to E (at step 14). 

Fig. 21 is a flow chart showing the multi-image 
output process (S1 3). 

In this process, images of a plurality of speak- 
70 ing locations are combined into one frame (at step 
S131). Thus, the images shown in Figs. 15 (a) and 
16 (a) are formed. 

The apparatus 1100 determines whether or not 
the output destination of the image data is current 
15 speaking locations (at step S132). 

When the output destination is current speak- 
ing locations (determined result at step SI 32 is 
YES), the apparatus 1100 replaces image data of 
speaking locations with image data of the former 
20 speaking location (at step S133). Thus, the appara- 
tus 1100 outputs (transmits) the divided screen 
image data to the current speaking locations (at 
step S134). 

Thus, when the speaking location is location A, 
25 the divided screen image data shown in Fig. 15 (a) 
or Fig. 16 (a) is replaced with the divided screen 
image data shown in Fig. 15 (b) or Fig. 16 (b), 
respectively. The apparatus 1100 outputs the di- 
vided screen image data to the television con- 
30 ference terminal unit 1200 A at the location A. 

When the output destination at step S132 is 
listening locations rather than current speaking lo- 
cations, the apparatus 1100 outputs (transmits) the 
combined image data (at step S131) to this listeri- 
as ing locations (at step S135). 

Thus, the divided screen image data shown in 
Figs. 15 (a) or Fig. 16 (a) is sent to the television 
conference terminal unit 1200E at the location E, 
which is a listening location. 
40 Fig. 22 is a flow chart showing the single 

image output process S14. 

In this process, the apparatus 1100 determines 
whether or not the output destination (transmission 
destination) of the divided screen image data is 
45 current speaking locations (at step S141). When 
the output destination is current speaking locations 
(determined result is YES), the apparatus 1100 
outputs (transmits) single image data that is image 
data of the former speaking location to the televi- 
so sion conference terminal units 1200 at the current 
speaking locations (at step S142). 

Thus, the apparatus 1100 transmits the single 
image data of the former speaking location in the 
format shown in Fig. 17 (b) to for example the 
55 location A, which is a single speaking location. 

On the other hand, when the output destination 
is a listening location at step S141, the apparatus 
1100 outputs (transmits) the single image data of 
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the current speaking location to the television con- 
ference terminal unit 120 at the listening location 
(at step S143). 

Thus, when only the location A is a speaking 
location, the apparatus 1100 outputs (transmits) 
single image data of the current speaking location 
A in the format shown in Fig. 17 (a) to the listening 
locations B, C, D, and E. 

In this embodiment, the multi-location control 
apparatus 1100 detects speaking locations. How- 
ever, it should be appreciated that each of the 
television conference terminal units 1200 may de- 
tect speaking attendants at respective locations 
and sends speech detecting data (along with sound 
level data if necessary) to the multi-location control 
apparatus 1100. 

Fig. 23 is a block diagram showing an overall 
construction of a television conference system ac- 
cording to a second embodiment of the present 
invention. Television conference terminal units 
1200* and a multi-location control apparatus 1100* 
of this system partially differ from those of the 
embodiment shown in Fig. 13. For simplicity, the 
same elements of Fig. 23 as that of Fig. 13 are 
denoted by the same reference numerals. 

The television conference terminal unit 1200' 
further comprises an operating unit 1301 and a 
data transmitting unit 1302. With the operating unit 
1301, the user of the television conference terminal 
unit 1200' can perform (1) displaying conference 
room switching operation, (2) designated mode 
switching operation, (3) automatic conference room 
switching enable/disable designating operation, (4) 
remote location camera control operation, and (5) 
designated mode forced changing operation. The 
data transmitting unit 1302 converts an operating 
signal received from the operating unit 1301 into a 
signal in a predetermined format and outputs the 
resultant signal to a multiplexing and demultiplex- 
ing (MUX/DMUX) unit 1209V The MUX/DMUX unit 
1209' multiplexes image data received from an 
image CODEC 1208, sound data received from a 
sound CODEC 1205, and the operating data re- 
ceived from the data transmitting unit 1302 and 
then outputs the multiplexed data to the multi- 
location control apparatus 1100'. 

Each of the MUX/DMUX unit 110V of the multi- 
location control apparatus 1100' has a data pro- 
cessing unit 1107. The data processing unit 1107 
extracts the operating data, which is received from 
the television conference terminal units 1200A' to 
1200X 1 , from the MUX/DMUX unit 110V. The data 
processing unit 1107 outputs the extracted operat- 
ing data to the control unit 1106'. 

Fig. 24 is a schematic diagram showing an 
inner construction of the image selecting unit 
1105'. 


As shown in this drawing, the image selecting 
unit 1105' comprises n input terminals 1155 and n 
output terminals 1156. The n input terminals 1155 
receive image data from the conference rooms A, 

s B, C, ... , and X. The n output terminals 1156 sends 
image data to the conference rooms A, B, C, ... , 
and X. The n input terminals 1155 and the n output 
terminals are connected to n input lines 1158 and n 
output lines 1159 of n x n switch elements 1157 

70 that are disposed in a matrix shape. Thus, the n 
input terminals 1155 can be connected to the n 
output terminals 1156 through the n switch ele- 
ments 1157 so that image data of any conference 
room can be sent to the television conference 

75 terminal units 1200' of the conference rooms A, B, 

C and X. An array of the switch elements 1157 

disposed in a n x n matrix shape is referred to as a 
matrix switcher 1154. Since the image selecting 
unit 1105' has such a construction, at each of the 

20 conference room of the system according to this 
embodiment, an independent designating mode 
can be selected in addition to the above-described 
NS/PS designating mode. 

The independent designating mode is a mode 

25 in which an image of a desired conference room 
can be independently selected without an influence 
of operations and speeches in other conference 
rooms. In this embodiment, even in the indepen- 
dent designating mode, when a speaking attendant 

30 is detected, the user can enable or disable auto- 
matic NS 
switching mode. 

In addition, a chairman conference room (main 
conference room) that can forcedly switch modes 

35 of all other conference rooms to a desired mode 
can be designated. 

Fig. 25 is a schematic diagram showing an 
example of an operation of this embodiment in the 
case that both the NS/PS designating mode and 

40 the independent designating mode are present. 

In this drawing, the conference rooms A, B, C, 

D, and E have been designated to the NS/PS 
designating mode, whereas only the conference 
room C has been designated to the independent 

45 designating mode. 

In this condition, when the conference room A 
is designated to the NS mode, the monitors at the 
conference rooms B, D, and E each display an 
image of the conference room A as represented by 

so a solid arrow of the drawing. On the other hand, the 
monitor at the conference room A displays an 
image of the conference room D that has been 
designated to the PS mode as represented by a 
dotted arrow of the drawing. The monitor at the 

55 conference room C that has been designated to the 
independent designating mode displays an image 
of the designated conference room E as repre- 
sented by a one-dashed arrow of the drawing. 
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Figs. 26 to 29 show various management ta- 
bles that the control unit 1106* provides. With these 
management tables, the control unit 1106' controls 
image data to be transmitted to the television con- 
ference terminal units 1200A' to 1200XV 

Fig. 26 is a schematic diagram showing a 
construction of a mode management table for man- 
aging modes designated to the conference rooms 
A, B, ... , and X. In addition to the above-described 
"designated mode", this table contains "automatic 
conference room switching flag" and "chairman 
flag". The automatic conference room switching 
flag represents whether the automatic switching of 
an image of a conference room is enabled or 
disabled when a speaking attendant is detected 
under the independent designating mode. The 
chairman flag represents whether each conference 
room is a chairman room. In these flags, the auto- 
matic conference room switching flag is valid only 
for conference rooms that have been designated to 
the independent designating mode. 

Fig. 27 is a schematic diagram showing a 
construction of an NS/PS management table. This 
table is used to manage whether each of the con- 
ference rooms has been designated to "NS", "PS", 
or "S" mode . The "S" mode represents a con- 
ference room that has been designated to other 
than "NS" and "PS" modes. 

Fig. 28 is a schematic diagram showing a 
construction of a displaying conference room table 
that is used to manage conference rooms currently 
displayed on the monitors 1207 at the conference 
rooms. 

Fig. 29 is a schematic diagram showing a 
construction of a matrix switcher status table that is 
used to manage current statuses of the switch 
elements 1157 of the matrix switcher 1154. In the 
drawing, mark "O" represents on status, whereas 
mark "X" represents off status. 

In this embodiment of the present invention, 
the multi-location control apparatus 1100' of the 
center station receives the following events (1) to 
(6) from the television conference terminal units 
1200A\ 1200B\ ... , and 1200X 1 at the conference 
rooms A, B, ... , and X. 

(1 ) Speaking attendant detecting event 

(2) Displaying conference room switching event 

(3) Designated mode switching event 

(4) Automatic conference room switching en- 
able/disable designating event 

(5) Remote location camera control event 

(6) Designated mode forced changing event 
Next, with reference to flow charts, operations 

of the control unit 1106' corresponding to occur- 
rences of these events will be described. 

Before such events take place, parameters in 
these tables have been designated as shown in 
Figs. 26 to 29. 
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In other words, as shown in the mode manage- 
ment table of Fig. 26, the conference rooms A, D, 
... , and X have been designated to the NS/PS 
designating mode and the conference rooms B and 

5 C have been designated to the independent des- 
ignating mode. The automatic conference room 
switching flag for the conference room B has been 
designated to the enable state. The automatic con- 
ference room switching flag for the conference 

ro room C has been designated to the disable state. 
The conference room A has been designated to a 
chairman conference room. 

As shown in the NS/PS management table of 
Fig. 27, the conference room A has been des- 

T5 ignated to "NS" mode and the conference room D 
has been designated to "PS" mode. The other 
conference rooms have been designated to "S" 
mode. 

In addition, as shown in the displaying con- 
20 ference room management table of Fig. 28, the 
television conference terminal units 1200D', ... , 

and 1200X 1 at the conference rooms D and X 

display an image of the conference room A that 
has been designated to "NS" mode. On the other 
25 hand, the television conference terminal unit 
1200A 1 that has been designated to "NS" mode 
displays an image of the conference room D that 
has been designated to "PS" mode. The television 
conference terminal units 1200B' and 1200C' at the 
30 conference rooms B and C each display images of 
the conference rooms G and D that have been 
selected at the conference rooms B and C, respec- 
tively. 

Fig. 29 shows statuses of the switch elements 

35 1157 of the matrix switcher 1154 corresponding to 
the above-described displaying modes. 

Next, with reference to a flow chart shown in 
Fig. 30, the operation of the control unit 1106' 
corresponding to an occurrence of (1) speaking 

40 attendant detecting event will be described. 

When the control unit 1106' receives a speech 
detecting signal from the sound detecting unit 
1103, the control unit 1106' determines whether or 
not the informed speaking attendant (the speaking 

45 conference room) has been designated to "NS" 
mode with reference to the NS/PS management 
table (at step S201). 

When the speaking conference room has been 
designated to NS mode, the control unit 1106' 

so rewrites the NS/PS management table so that the 
speaking conference room is designated to NS 
mode and a conference room that has been des- 
ignated to NS mode is designated to PS mode (at 
step S202). 

55 Thus, when an attendant speaks at the con- 
ference room X, the control unit 1106' rewrites the 
PS/NS management table as shown in Fig. 31 (a). 
In other words, as the conference room X is 
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changed to NS mode, the conference room A, 
which has been designated to NS mode, is 
changed to PS mode, and the conference room D, 
which has been designated to PS mode, is 
changed to S mode. 

The control unit 1106' initializes the conference 
room number to "1" (at step S203). The control 
unit 1106* determines the current designated mode 
of a conference room with this conference room 
number with reference to the mode management 
table (at step S204). When the conference room 
has been designated to the NS/PS designating 
mode, since the image of the conference room is 
switched, the control unit 1106* correspondingly 
rewrites the displaying conference room manage- 
ment table (at step S205). 

When the conference room has been desig- 
nated to the independent designating mode (at 
step S204), the control unit 1106' determines 
whether the conference room has been designated 
to the automatic conference room switching enable 
mode with reference to the automatic conference 
room switching flag (at step S206). 

When the conference room has been desig- 
nated to the automatic conference room switching 
enable mode, since the image of the conference 
room is next designated to NS mode, the control 
unit 1106' correspondingly rewrites the displaying 
conference room table (at step S205). When the 
conference room has been designated to the auto- 
matic conference room switching disable mode, 
since the image of the conference room is not 
switched, the control unit 1106' does not rewrite 
the displaying conference room table. 

The control unit 1106' repeats the above-de- 
scribed steps S204 to S206 for all the conference 
rooms in the ascending order of the conference 
room numbers (at step S207 and S208). 

Thus, the control unit 1106' rewrites the dis- 
playing conference room management table so that 
the contents as shown in Fig. 28 are changed to 
those as shown in Fig. 31 (b). In other words, the 
displaying conference room management table is 
rewritten so that the monitor at the conference 
room A that is designated to PS mode displays an 
image of the conference room X that is designated 
to NS mode and the monitor at the conference 
room D of which the mode in changing from PS to 
S displays an image of the conference room X that 
is designated to NS mode. In addition, since the 
conference room B has been designated to the 
independent designating mode and the automatic 
conference room switching enable mode, the dis- 
playing conference room management table is re- 
written so that the monitor at the conference room 
B displays an image of the conference room X that 
is designated to NS mode. In addition, the display- 
ing conference room management table is rewritten 


so that the monitors at the remaining conference 
rooms E, and so forth (not shown) each display 
an image of the conference room X. Since the 
monitor at the conference room X displays an 

s image of the conference room A that is newly 
designated to PS mode, the displaying conference 
room management table is not rewritten. 

After the displaying management room man- 
agement table has been rewritten, the control unit 

w 1106' rewrites the matrix switcher status table cor- 
responding to the contents of the displaying man- 
agement room management table (at step S208). 

Thus, the matrix switcher status table is rewrit- 
ten so that the contents as shown in Fig. 29 are 

75 changed to those as shown in Fig. 31 (c). 

When the speaking conference room has been 
designated to NS mode at step S201, since NS 
and PS modes are not changed, the control unit 
1106' immediately terminates the process without 

20 rewriting the matrix switcher status table. Thus, 
images displayed on the monitors 1207 at the 
conference rooms A, B, ... , and X are not 
switched. 

Next, with reference to a flow chart shown in 

25 Fig. 32, the operation of the control unit 1106' 
corresponding to an occurrence of (2) displaying 
conference room switching event will be described. 

When a displaying conference room switching 
operation is performed at any conference room 

30 with the operating unit 1301 of the television con- 
ference terminal unit 1200', the control unit 1106' 
determines the mode of the operating conference 
room with reference to the mode management ta- 
ble (at step S221). 

35 When the operating conference room has been 

designated to the independent designating mode, 
the control unit 1106' rewrites the displaying con- 
ference room management table so that the col- 
umn of this conference room of this table repre- 

40 sertts the designated conference room (at step 
S222). In addition, the control unit 1106' rewrites 
the matrix switcher status table corresponding to 
the contents of the rewritten displaying conference 
room management table (at step S231). 

45 Thus, in the case that the contents of these 
tables are as shown in Figs. 26 to 29, when the 
user in the conference room B switches the moni- 
tor 1207 to an image of the conference room X, the 
displaying conference room management table is 

so rewritten so that only the column of the conference 
room of this table B represents the conference 
room X instead of the conference room C. Thus, 
the matrix switcher status table is rewritten so that 
the contents as shown in Fig. 29 are changed to 

55 the contents shown in Fig. 33 (b). 

Thus, the user at a conference room that has 
been designated to the independent designating 
mode can switch an image of a desired conference 
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room disposed on the user's monitor 1207 in- 
dependently of the other conference rooms. 

When the operating conference room (at step 
S221) has been designated to the NS/PS designat- 
ing mode, the control unit 1106' determines wheth- 
er or not the conference room has been designated 
to NS mode with reference to the NS/PS manage- 
ment table (at step S223). When the operating 
conference room has been designated to NS 
mode, the control unit 1106' rewrites the NS/PS 
management table so that the column of the des- 
ignated conference room of this table represents 
PS mode and the column of a conference room 
that has been designated to PS mode represents S 
mode (at step S224). tn other words, since a moni- 
tor at a conference room that has been designated 
to NS mode displays an image of a conference 
room that has been designated to PS mode, the 
control unit 1106' correspondingly rewrites the 
NS/PS management table. 

Thus, when the user at the conference room A 
that has been designated to NS mode performs a 
switching operation that displays an image of the 
conference room X on the user's monitor 1207, the 
NS/PS management table is rewritten so that the 
contents as shown in Fig. 27 are changed to those 
as shown in Fig. 34 (a). 

On the other hand, when the operating con- 
ference room (at step S223) has been designated 
to PS or S, the control unit 1106' rewrites the 
NS/PS management table so that the column of the 
designated conference room represents NS, the 
column of the conference room that has been 
designated to NS mode represents PS, and the 
column of the conference room that has been 
designated to PS mode represents S (at step 
S225). 

Thus, when the user at the conference room D 
that has been designated to PS mode performs a 
switching operation that displays an image of the 
conference room X on the user's monitor 1207, the 
NS/PS management table is rewritten so that the 
contents as shown in Fig. 27 are changed to those 
as shown in Fig. 35 (a). 

After rewriting the NS/PS management table, 
the control unit 1106' initializes the conference 
room number to "1" (at step S226). 

The control unit 1106' determines whether the 
conference room corresponding to the conference 
room number has been designated to the NS/PS 
designating mode or the independent designating 
mode with reference to the mode management 
table (at step S227). When the conference room 
has been designated to the NS/PS designating 
mode, the control unit 1 1 06* rewrites the column of 
the conference room of the displaying conference 
room management table corresponding to the con- 
tents of the NS/PS management table (at step 


S228). In other words, the control unit 1106' re- 
writes the NS/PS management table so that a mon- 
itor of a conference room that has been designated 
to NS mode displays an image of a conference 

5 room that has been designated to PS mode and 
monitors at other conference rooms each display 
an image of conference room that has been newly 
designated to NS mode. 

When the conference room (at step S227) has 

ro been designated to the independent designating 
mode, since this conference room is not affected 
by a switching operation of another conference 
room, the control unit 1106' does not rewrite the 
displaying conference room management table. 

rs The control units 1106' repeats the above-de- 
scribed steps S227 to S228 for all the conference 
rooms in the ascending order of the conference 
room numbers (at step S229 and S230). After 
completing this process, the control unit 1106' re- 

20 writes the matrix switcher status table correspond- 
ing to the contents of the displaying conference 
room management table (at step S231). 

Thus, when the user at the conference room A 
that has been designated to NS mode performs a 

25 switching operation that displays an image of the 
conference room X, the control unit 1106' rewrites 
the displaying conference room management table 
so that only the column of the conference room A 
represents the conference room X instead of the 

30 conference room D to as shown in Rg. 34 (b). 
When the user at the conference room D that has 
been designated to S mode performs a switching 
operation that displays an image of the conference 
room A, the control units 1106' rewrites the dis- 

35 playing conference room table so that the contents 
as shown in Fig. 28 are changed to those as shown 
in Rg. 35 (b). In other words, the control unit 1106* 
rewrites the displaying conference room table so 
that the column of a conference room that has 

40 been designated to PS or S mode among con- 
ference rooms designated to PS or S mode among 
conference rooms which are in the NS/PS des- 
ignating mode represents the conference room X 
that is designated to NS mode. In these cases, the 

45 control unit 1106' rewrites the matrix switcher sta- 
tus table as shown in Rgs. 34 (c) and 35 (c). 

Next, with reference to a flow chart shown in 
Rg. 36 the operation of the control unit 1106* 
corresponding to an occurrence of (3) designated 

so mode changing event will be described. 

When the user at a conference room performs 
a designated mode changing operation with the 
operating unit 1301 of the television conference 
terminal unit 1200', the control unit 1106' deter- 

55 mines whether the operating conference room has 
been designated to the independent designating 
mode or the NS/PS designating mode with refer- 
ence to the mode management table (at step 
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S241). 

When the operating conference room has been 
designated to the independent designating mode, 
the control unit 1106* rewrites the mode manage- 
ment table so that the mode of the conference 
room is changed to the NS/PS designating mode 
(at step S242). In addition, the control unit 1106' 
resets the automatic conference room switching 
flag of the conference room in the mode manage- 
ment table corresponding to the mode switching 
operation (at step S243). Then, the control unit 
1106' rewrites the displaying conference room 
management table so that the column of this con- 
ference room represents a conference room that 
has been designated to NS mode with reference to 
the NS/PS management table (at step S244). 

The control unit 1106' determines whether the 
contents of the displaying conference room man- 
agement table have been changed (at step S245). 
When the contents have been changed, the control 
unit 1106* rewrites the matrix switcher status table 
corresponding to the contents of the displaying 
conference room management table (at step S246). 

Thus, when the user at the conference room B 
that has been designated to the independent des- 
ignating mode performs a designated mode chang- 
ing operation, the control unit 1106' rewrites the 
mode management table so that the contents as 
shown in Fig. 26 are changed to the contents 
shown in Fig. 37 (a). Next, the control unit 1106' 
rewrites the displaying conference room manage- 
ment table as shown in Fig. 37 (c) so that the 
column of the conference room B represents the 
conference room A that has been designated to NS 
mode instead of the conference room D with refer- 
ence to the NS/PS management shown in Fig. 37 
(b). In addition, the control unit 1106' rewrites the 
matrix switcher status table so that the statuses of 
the switch elements 1157 corresponding to the 
conference room B are changed as shown in Fig. 
37 (d) corresponding to the contents of the display- 
ing conference room management table. When the 
monitor at the conference room B displays an 
image of the conference room A that has been 
designated to NS mode, since the image displayed 
on the monitor is not changed, the control unit 
1106' does not rewrite the displaying conference 
room management table and the matrix switcher 
status table. 

When the operating conference room has been 
designated to the NS/PS designating mode (at step 
S241), the control unit 1106' rewrites the mode 
management table so that the designated mode of 
the conference room is changed to the indepen- 
dent designating mode (at step S247) and sets the 
automatic conference room switching flag to "en- 
able" (at step S248). 


Thus, when the user at the conference room D 
performs a designated mode changing operation, 
the control unit 1106' rewrites the mode manage- 
ment table so that the designated mode of the 
5 conference room D and the automatic conference 
room switching flag are changed as shown in Rg. 
38. 

Next, with reference to a flow chart shown in 
Rg. 39, the operation of the control unit 1106' 

70 corresponding to an occurrence of (4) automatic 
conference room switching enable/disable desig- 
nating event will be described. 

With the operating unit 1301 of the television 
conference terminal unit 1200* at a particular con- 

75 ference room, when the user performs an auto- 
matic conference room switching enable/disable 
operation, the control unit 1106' determines wheth- 
er or not the operating conference room has been 
designated to the independent designating mode or 

20 the NS/PS designating mode with respect to the 
mode management table (at step S251). 

When the operating conference room has been 
designated to the independent designating mode, 
the control unit 1106' rewrites the mode manage- 

25 ment table so that the automatic conference room 
switching enable/disable designating flag of the 
conference room represents the reverse status. In 
other words, if the flag has been set to "enable", 
the control unit 1106' changes the status to "dis- 

30 able", whereas if the flag has been set to "dis- 
able", the control unit 1106' changes the status to 
"enable" (at the step S252). 

When the operating conference room has been 
designated to the NS/PS designating mode (at the 

35 step S251), since the automatic conference room 
switching enable/disable operation is invalid, the 
control unit 1106' immediately terminates the pro- 
cess. 

Thus, when the user at a conference room 

40 performs the automatic conference room switching 
enable/disable operation with the operating unit 
1301 of the television conference terminal unit 
1200 1 , the user can select a mode for automatically 
displaying an image of a speaking conference 

45 room when it is detected. 

Next, with reference to a flow chart shown in 
Rg. 40, the operation of the control unit 1106' 
corresponding to an occurrence of (5) remote loca- 
tion camera controlling event will be described. 

so When the user at a conference room performs 
a remote location camera controlling operation 
such as rotating a camera or zooming an image 
thereof with the operating unit 1301 of the televi- 
sion conference terminal unit 1200', the control unit 

55 1106' determines a conference room displayed on 
the monitor at the operating conference room with 
reference to the displaying conference room man- 
agement table (at step S261). 
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The control unit HQS 1 sends camera control 
data to the television conference terminal unit 
1200/ at the conference room displayed on the 
monitor 1207 of the television conference terminal 
unit 1200' at the operating conference room (at 
step S262). 

Thus, the user at any conference room can 
remotely control a camera 1206 disposed at an- 
other conference room displayed on the monitor 
1207 of the television conference terminal unit 
1200' at the user's conference room so as to 
display an image of a desired conference room. 

A camera 1206 at a conference room that has 
been designated to the independent designating 
mode may not be controlled. In addition, the re- 
mote location camera control operation may be 
valid for a conference room that has been des- 
ignated to the independent designating mode. 

Last, with reference to a flow chart shown in 
Rg. 41, the operation of the control unit 1106' 
corresponding to an occurrence of (6) designated 
mode forced changing event will be described. 

When the user at a conference room performs 
a designated mode forced changing operation (this 
conference room is referred to as the operating 
conference room) with an operating unit 1301 of a 
television conference terminal unit 1200", the con- 
trol unit 1106' determines whether or not the op- 
erating conference room has been designated to a 
chairman conference room (at step S271). 

When the operating conference room has been 
designated to a chairman conference room, the 
designated mode to be changed is either the 
NS/PS designating mode or the independent des- 
ignating mode (at step S272). 

When the operating conference room has been 
designated to the NS/PS designating mode, the 
control unit 1106' changes the modes of all other 
conference rooms in the mode management table 
to the NS/PS designating mode (at step S273). In 
addition, the control unit 1106' resets all the auto- 
matic conference room switching flags in the mode 
management table (at step S74). 

On the other hand, when the designated mode 
to be changed is the independent designating 
mode, the control unit 11 06* changes the modes of 
all other conference rooms in the mode manage- 
ment table to the independent designating mode 
(at step S275). In addition, the control unit 1106 1 
sets the automatic conference room switching flags 
of all other conference rooms in the mode manage- 
ment table to "enable" (at step S276). 

When the operating conference room has not 
been designated to a chairman conference room 
(at step S2761), since this operation is invalid for 
other than the chairman conference room, the con- 
trol unit 1106* immediately terminates this process. 


Thus, the user at a chairman conference room 
can forcedly change the modes of all other con- 
ference rooms to a desired mode. 

The chairman conference room can be des- 
5 ignated by i) multi-location control apparatus 1100', 
to ii) a conference room where a television con- 
ference terminal unit 1200' has been connected to 
the multi-location control apparatus 1100' at first, or 
the like. 

to According to this embodiment, when a televi- 
sion conference terminal unit is designated to the 
independent designating mode and the automatic 
conference room switching disable mode, images 
displayed on the monitor thereof are not switched 

is by an operation at another conference room or 
when a speaking conference room is detected. In 
addition, at a chairman conference room, modes of 
all other conference rooms can be forcedly 
changed. Thus, the multi-location television con- 

20 ference system according to this embodiment can 
manage a multi-location conference in a manner 
similar to a real conference. 

Moreover, even in the independent designating 
mode, an automatic switching mode for an image 

25 of a speaking conference room can be selected. 
Thus, when an important speech is made at an- 
other conference room, this speech can be heard 
at a conference room that has been designated to 
the independent designating mode. 

30 Rg. 42 is a block diagram showing an overall 

construction of a multi-location television confer- 
ence system according to a third embodiment of 
the present invention. 

A multi-location control apparatus 3100 is con- 

35 nected to television conference terminal units 
3200A, 3200B, ... , and 3200Z at locations A, B f ... , 
and Z through a dedicated line, subscriber line 
(such as ISDN), or the like. 

The multi-location control apparatus 3100 

40 switches and controls image data of all the con- 
ference rooms received from the television con- 
ference terminal units 3200A, 3200B, ... , and 
3200Z and distributes image data received from a 
particular television conference terminal unit to 

45 television conference terminal units 3200 at other 
conference rooms. 

In this embodiment, when the multi-location 
control apparatus 3100 distributes image data of a 
particular conference room to other conference 

so rooms, the apparatus 3100 sends information with 
respect to the location name of the transmitting 
conference room as an image or code that is 
multiplexed with the image data of the transmitting 
conference room. Thus, as shown in Rg. 43, a 

55 monitor of each of the television conference termi- 
nal units 3200 displays a superimposed image of 
the transmitting conference room and the informa- 
tion (location name thereof). 
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Fig. 44 is a block diagram showing an example 
of a construction of a first system that accom- 
plishes such a display system. 

An image transmitting system of the television 
conference terminal unit 3200 comprises a camera 
3210, a location name image generating unit 3220, 
an image superimposing unit 3230, and an image 
encoding unit 3240. 

The camera 3210 photographs an image of a 
conference room and outputs an image signal (for 
example, NTSC signal or PAL signal) to the image 
superimposing unit 3230. The location name image 
generating unit 3220 generates an image of a loca- 
tion name of the conference room and outputs the 
image signal to the image superimposing unit 
3230. 

The image superimposing unit 3230 superim- 
poses the location name image signal, which has 
been received from the location name image gen- 
erating unit 3220 over a predetermined part of the 
image signal of the conference room, which has 
been received from the camera 3210, and outputs 
the superimposed signal, which is an image signal, 
to the image encoding unit 3240. 

The image encoding unit 3240 encodes the 
superimposed image signal to digital data.' The 
digital data is sent to the multi-location control 
apparatus 3100 through a network interface (not 
shown). The image encoding unit 3240 encodes 
the image signal corresponding to a predetermined 
encoding technique for example CCITT Recom- 
mendation H. 261 (NTSC signal, PAL signal, or the 
like). 

The muiti- location control apparatus 3100 
switches and controls digital encoded data of im- 
ages received from the television conference termi- 
nal units 3200 at conference rooms and distributes 
the selected data to the television conference ter- 
minal units 3200 at other conference rooms. 

An image receiving system of the television 
conference terminal unit 3200 comprises an image 
decoding unit 3260 and a receiving monitor 3280. 

The image decoding unit 3260 decodes digital 
encoded data received from the multi-location con- 
trol apparatus 3100 to an original image signal 
(NTSC signal, PAL signal, or the like) and outputs 
the decoded signal to the receiving monitor 3270. 

The receiving monitor 3270 displays an image 
where the location name has been superimposed 
over the image of the conference room. 

Fig. 45 is a block diagram showing an example 
of a construction of a second system according to 
the third embodiment of the present invention. 

An image transmitting system of a television 
conference terminal unit 3200 of this system com- 
prises a location name code generating unit 3310, 
a signal multiplexing unit 3320, and a line process- 
ing unit 3330 in addition to the above-described 


camera 3210 and image encoding unit 3240. 

The camera 3210 photographs an image of the 
conference room and outputs the resultant image 
signal to the image encoding unit 3240. The image 

5 encoding unit 3240 encodes the image signal to a 
digital encoded signal and outputs the encoded 
signal to the signal multiplexing unit 3320. 

The location name code generating unit 3310 
generates a location name code (character code) of 

w the conference room and outputs the location name 
code to the signal multiplexing unit 3320. 

The signal multiplexing unit 3320 multiplexes 
the encoded signal of the image of the conference 
room, which has been input from the image encod- 

75 ing unit 3240, with the location name code, which 
has been received from the location name code 
generating unit 3310, and outputs the multiplexed 
signal to the line processing unit 3330. 

The tine processing unit 3330 transmits the 

20 multiplexed data to the multi-location control ap- 
paratus 3100. 

The multi-location control apparatus 3100 re- 
ceives the multiplexed data from television con- 
ference terminal units 3200 at conference rooms 

25 and distributes the multiplexed data to the televi- 
sion conference terminal units 3200 at the con- 
ference rooms. 

An image receiving system of the television 
conference terminal unit 3200 comprises a signal 

30 demultiplexing unit 3360 and a location name im- 
age generating unit 3370 in addition to the above- 
described image decoding unit 3260, image super- 
imposing unit 3380, and receiving monitor 3270. 
The signal demultiplexing unit 3360 demul- 

35 tiplexes the multiplexed data into digital encoded 
data of an image of a conference room and a 
location name code. The digital encoded data and 
the location name code are output to the image 
decoding unit 3260 and the location name image 

40 generating unit 3370, respectively. 

The location name image generating unit 3370 
generates a location name image signal corre- 
sponding to the received location name code and 
outputs the image signal to the image superim- 

45 posing unit 3380. 

The image superimposing unit 3380 superim- 
poses the location name image signal over the 
image signal of the conference room received from 
the image decoding unit 3360 and outputs the 

so superimposed signal, which is an image signal, to 
the receiving monitor 3270. 

The receiving monitor 3270 displays the super- 
imposed image, where the location name of the 
conference room has been superimposed over the 

55 image of the conference room. 

Fig. 46 is a block diagram showing an example 
of a construction of a third system according to the 
third embodiment. 
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An image transmitting system of the television 
conference terminal unit 3200 of each conference 
room multiplexes a location number designated to 
the conference room with encoded data of an im- 
age signal of the conference room and transmits 
the multiplexed signal to the multi-location control 
apparatus 31 00. In other words, the image transmit- 
ting system of the third system has a location 
number generating unit (not shown) that generates 
a location number instead of the location name 
code generating unit 3310 of the second system. 

The multi-location control apparatus 31 00 com- 
prises a database unit 3110, a demultiplexing unit 
3120, a location name code retrieving unit 3130, 
and a multiplexing and distributing unit 3140. The 
database unit 3110 stores location name codes 
corresponding to location numbers 1, 2, ... , and n 
at the conference rooms. 

The demultiplexing unit 3120 demultiplexes the 
multiplexed data, where the image data of the 
conference room has been multiplexed with the 
location number into an image signal encoded data 
and a location number. The location number and 
the image signal encoded data are output to the 
code retrieving unit 3130 and the multiplexing and 
distributing unit 3140, respectively. 

The location name code retrieving unit 3130 
retrieves a location name code corresponding to 
the location number from the database unit 3110 
and outputs the location name code to the mul- 
tiplexing and distributing unit 3140. 

The multiplexing and distributing unit 3140 
multiplexes the location name code with the image 
signal encoded data of the conference room and 
distributes the multiplexed signal to the television 
conference terminal units 3200 at the conference 
rooms. 

An image receiving system of the television 
conference terminal unit 3200 has a construction 
similar to that of the second system. As with the 
second system, a receiving monitor 3270 displays 
a superimposed image, where the location name 
has been superimposed over the image of the 
conference room. 

Fig. 47 is a block diagram showing an example 
of a construction of a fourth system according to 
the third embodiment of the present invention. 

The construction of an image transmitting sys- 
tem of a television conference terminal unit 3200 of 
the fourth system is similar to that of the above- 
described third system. 

The multi-location control apparatus 3100 of 
the fourth system comprises an image decoding 
unit 3150, a location name image generating unit 
3160, an image superimposing unit 3170, and an 
encoding and distributing unit 3180 in addition to 
the above-described database unit 3110, demul- 
tiplexing unit 3120, and location code retrieving unit 


3130 as with the construction of the third system. 

The image decoding unit 3150 decodes en- 
coded data received from the demultiplexing unit 
3120 to an image signal of the conference room, 
5 which has been switched and selected, and outputs 
the decoded signal to the image superimposing 
unit 3170. 

The location name image generating unit 3160 
generates an image signal of the location name 
to corresponding to the location name code received 
from the location name code retrieving unit 3130 
and outputs the image signal to the image super- 
imposing unit 3170. 

The image superimposing unit 3170 superim- 
75 poses the image signal of the location name over a 
predetermined part of the image signal of the con- 
ference room. In addition, the image superimposing 
unit 3170 outputs the superimposed image signal 
to the encoding and distributing unit 3180. 
20 The encoding and distributing unit 3180 en- 
codes the superimposed image signal correspond- 
ing to an encoding technique for example CCITT 
Recommendation H. 261 and distributes the en- 
coded signal to the television conference terminal 
25 units 3200 at the conference rooms. 

An image receiving system of the television 
conference terminal unit 3200 at each conference 
room has a construction similar to that of the first 
system. The image receiving system comprises 
30 the image decoding unit 3260 and the receiving 
monitor 3270. The image decoding unit 3260 de- 
codes the encoded data of the image signal re- 
ceived from the multi-location control apparatus 
3100 into the image of the conference room and 
35 the location name thereof, which have been 
switched and selected. The monitor 3270 displays 
the superimposed image, where the location name 
of the conference room has been superimposed 
over the image of the conference room. 
40 In addition, the present invention may be ap- 

plied to the following system. 

(1) The multi-location control apparatus receives 
encoded image data of a conference room and 
a location code thereof from each television 
45 conference terminal unit The multi-location con- 
trol apparatus decodes the encoded image data 
of the conference room to an image signal and 
generates an image signal of a location name 
corresponding to the location code. Thereafter, 
so the multi-location control apparatus generates an 
image signal where the image signal of the 
location name has been superimposed over the 
image signal of the conference room, encodes 
the superimposed image signal, and distributes 
55 the encoded signal to each television confer- 
ence terminal unit. 

Each television conference terminal unit de- 
codes the encoded data to an image signal and 
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displays an image where the location name has 
been superimposed over the image of the con- 
ference room on the monitor. 
(2) The television conference terminal unit multi- 
plexes the encoded image data of the con- 
ference room and the location number and 
transmits the multiplexed signal to the multi- 
location control apparatus. 
The multi-location control apparatus distributes 
the multiplexed data to the television conference 
terminal unit as it is. 

The television conference terminal unit demul- 
tiplexes the multiplexed data of the conference 
room and location number into encoded image 
data thereof. Next, the television conference termi- 
nal unit decodes the encoded image data of the 
conference room to the image signal and generates 
the image signal of the location name correspond- 
ing to the location number. In addition, the televi- 
sion conference terminal unit superimposes the 
image signal of the location name over the image 
signal of the conference room. Thus, a monitor of 
the television conference terminal unit displays an 
image where the location name has been superim- 
posed over the image of the conference room. 

In this embodiment, the image of the location 
name is superimposed over the image of the con- 
ference room. However, the present invention is not 
limited to this method. Instead, background infor- 
mation (such as the name of a speaking attendant 
and his/her title) at a conference room may be 
superimposed over an image of the conference 
room. 

According to the present invention, when 
speeches are detected at a plurality of locations, 
image data that contains images of these speaking 
locations in one screen is generated and distrib- 
uted to each television conference terminal unit. 
Thus, when a discussion is performed at a plurality 
of locations, images of the speaking locations can 
be displayed at a time. In addition, the number of 
images of locations displayed on the screen varies 
as the number of speaking locations varies. Thus, 
images with reality can be displayed. Moreover, 
even if a plurality of speeches are made at the 
same time, since these locations are displayed on 
the same screen, speaking locations can be easily 
identified. 

At each listening location, images of only 
speaking locations are displayed. Thus, at each 
listening location, speaking locations can be imme- 
diately identified. 

At each speaking location, since images of 
other speaking locations and an image of a former 
speaking location are displayed on the same 
screen. Thus, at each speaking location, other 
speaking locations can be easily identified. 


Since image data switched and selected is 
output to each television conference terminal unit 
independent from other television conference ter- 
minal units, at each television conference terminal 

5 unit, an image of a desired conference room can 
be selected without necessity to switch monitor 
images displayed at other television conference 
terminal units. In addition, images displayed at 
each television conference terminal unit are not 

w switched by an operation at another conference 
room. 

At each conference room, a first mode that can 
select an image of a desired conference room 
independent from other conference rooms or a 

75 second mode that can switch an image of each 
conference room corresponding to a detection of a 
speaking location, by an operation at another televi- 
sion conference terminal unit, or the like can be 
selected. Thus, the mode where an image dis- 

20 played is not switched by an operation at another 
conference room, corresponding to a detection of a 
speaking location, or the like (first mode) or the 
mode where an image displayed is automatically 
switched (second mode) can be selected. Con- 

25 sequently, images can be displayed depending on 
the situations. 

Even in the first mode, a mode that automati- 
cally displays an image of a conference room 
corresponding to a detection of a speaking location 

30 can be selected. Thus, speeches can be securely 
heard. 

At a conference room that has been designated 
to a main conference room, modes of other con- 
ference rooms can be designated when necessary. 
35 Thus, a conference similar to a real conference, 
which is managed by a chairman, can be accom- 
plished. 

A camera at a conference room whose image 
is displayed can be remotely controlled from an- 

40 other conference room. Thus, a desired image of 
the conference room can be obtained by control- 
ling the zooming and orientation of the camera. 

Since information with respect to an image of a 
conference room, which includes for example the 

45 location name of the conference room, the name of 
the speaking attendant at the conference room, and 
the title of the speaking attendant, can be dis- 
played, the conference can be smoothly pro- 
gressed. 

50 

Claims 

1. A multi-location control apparatus for use with 
a multi-location television conference system 
55 that has a plurality of multi-location television 

conference terminal means (161-i) (where i = 
1, 2, ... , and N) disposed at three or more 
locations, each of said television conference 
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terminal means being adapted for transmitting 
and receiving sound data and image data, said 
multi-location control apparatus being adapted 
for receiving sound data and image data from 
all of said television conference terminal 5 
means (161 -i) and for transmitting sound data 
and image data to said television conference 
terminal means (161-i), said multi-location con- 
trol apparatus comprising: 

speaking location detecting means (171) w 
for detecting a speaking location correspond- 
ing to sound signals that are output from 
sound input means of each of said television 
conference terminal means (161-i); 

image combining means (172) for generat- is 
ing image data containing images of speaking 
locations when a plurality of speaking locations 
are detected by said speaking location detect- 
ing means (171); and 

image distributing means (173) for trans- 20 
mitting image data generated by said image 
combining means (172) to each of said televi- 
sion conference terminal means (161-i). 

2. The multi-location control apparatus as set 25 
forth in claim 1 , 

wherein said image combining means 
(172) is adapted for generating image data of 
speaking locations detected by said speaking 
location detecting means (171) for listening 30 
locations where speeches are not detected by 
said speaking location detecting means (171), 
the image data being contained in one screen, 
and 

wherein said image distributing means 35 
(172) is adapted for distributing the image data 
to television conference terminal means (161-i) 
of the listening locations. 

3. The multi-location control apparatus as set 40 
forth in claim 1 , 

wherein said image combining means 

(172) is adapted for generating image data of 
other speaking locations and a former speak- 
ing location for each of the speaking locations, 45 
the image data being contained in one screen, 

and 

wherein said image distributing means 

(173) is adapted for transmitting the image 
data to television conference terminal means so 
(161-i) of the speaking locations. 

4. A multi-location control apparatus for use with 
a mutti-location television conference system 

that has a plurality of multi-location television 55 
conference terminal means (200-i) (where i = 
1, 2, ... , and N) disposed at three or more 
locations, each of said television conference 


terminal means being adapted for transmitting 
and receiving sound data and image data, said 
multi-location control apparatus being adapted 
for switching and controlling sound data and 
image data transmitted among all of said tele- 
vision conference terminal means (200-i), said 
multi-location control apparatus comprising: 

switching and outputting means (231) for 
inputting image data of all of said conference 
rooms from all of said television conference 
terminal means (200-1, 200-2, ... , and 20ON) 
and for switching and outputting image data of 
any conference room to each of said television 
conference terminal means (200-i) independent 
of other television conference terminal means 
(200-j) (where j * i, j = 1 , 2, .... and N); and 

control means (232) for controlling said 
switching and outputting means (231) so that 
when a switching request for displaying an 
image of a desired conference room is issued 
from a particular television conference terminal 
means (200-i), image data to be transmitted to 
this particular television conference terminal 
means (200-i) is switched to the desired image 
data by the particular television conference 
terminal means (200-i). 

5. A multi-location control apparatus for use with 
a multi-location television conference system 
that has a plurality of mufti-location television 
conference terminal means (400-i) (where i = 
1, 2, ... , and N) disposed at three or more 
locations, each of said television conference 
terminal means being adapted for transmitting 
and receiving sound data and image data, said 
multi-location control apparatus being adapted 
for switching and controlling sound data and 
image data transmitted among all of said tele- 
vision conference terminal means (400-i), said 
multi-location control apparatus comprising: 

mode designating means having a first 
mode and a second mode, said first mode 
being adapted for selecting image data of a 
conference room displayed at a television con- 
ference terminal means (400-i) independent of 
other television conference terminal means 
(400-j) (where j * i, j = 1,2, ... and , N), said 
second mode being adapted for forcedly se- 
lecting image data of a conference room dis- 
played at a television conference terminal 
means (400-j) corresponding to an occurrence 
of a predetermined event of another television 
conference terminal means (400-i); 

mode designating means (351) for des- 
ignating either said first mode or said second 
mode to each of said conference rooms cor- 
responding to a mode designating request re- 
ceived from a television conference terminal 
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means (400-i) of each of said conference 
rooms; 

switching and outputting means (352) for 
inputting image data of all of said conference 
rooms from all of said television conference 5 

terminal means (340-1, 340-2 and 340-N) 

and for switching and outputting image data of 
any conference room to each of said television 
conference terminal means (400-i) independent 
of other television conference terminal means 10 
(400-j) (where j * i, j = 1,2,..., and N); 

first control means (353) for controlling 
said switching and outputting means (352) so 
that when a switching request for displaying an 
image of a desired conference room is issued 75 
from a particular television conference terminal 
means (340-i) that has been designated to said 
first mode, image data to be transmitted to this 
particular television conference terminal means 
(340-i) is switched to the desired image data 20 
by the particular television conference terminal 
means (340-i); and 

second control means (354) for controlling 
said switching and outputting means (352) so 
that when said event takes place in a particular 25 
television conference terminal means (340-i), 
image data to be transmitted to television con- 
ference terminal means (340-k) (where k = 1, 

2 and N) that have designated to said 

second mode is switched to image data of the 30 
television conference terminal means (340-i). 

6. The multi-location control apparatus as set 
forth in claim 5, 

wherein said event is a detection of a 35 
speaking conference room, 

wherein said mode designating means 
(351) is adapted for designating an automatic 
switching enable mode to a conference room 
when receiving an automatic switching request 40 
from a television conference terminal means 
(340-i) of the conference room that has been 
designated to said second mode, 

wherein said second control means (354) 
is adapted for controlling said switching and 45 
outputting means (352) so that when a speech 
is detected by a television conference terminal 
means (340-i) at a particular conference room, 
image data to be transmitted to television con- 
ference terminal means (340-j)(where j = 1,2, 50 
... ,and N) of conference rooms that have been 
designated to said second mode and said 
automatic switching enable mode is switched 
to image data received from the television con- 
ference terminal means (340-i). 55 

7. The multi-location control apparatus as set 
forth in claim 5 or 6, further comprising: 


main conference room designating means 
(355) for selecting and designating a main con- 
ference room from all of said conference 
rooms; and 

mode changing means (356) for forcedly 
changing modes of all of said conference 
rooms managed by said mode designating 
means (351) to either said first mode or said 
second mode designated by a television con- 
ference terminal means (340-i) at a particular 
conference room when the particular confer- 
ence room is said main conference room. 

8. The multi-location control apparatus as set 
forth in claim 5, 6, or 7, further comprising: 

displaying conference room managing 
means (357) for receiving switching information 
from said first and second control means (353) 
and (354) so as to store conference rooms 
displayed on television conference terminal 
means (340-i) at said conference rooms; 

transferring means (358) for seeking a 
conference room whose image is being dis- 
played on a television conference terminal 
means (340-i) at a particular conference room 
with reference to stored information in said 
displaying conference room managing means 
(357) and for transferring said camera control 
information to the television conference termi- 
nal means (340-j) when receiving camera con- 
trol information from the television conference 
terminal means (340-i) at the particular con- 
ference room. 

9. A television conference terminal apparatus, 
comprising: 

image superimposing means (461) for 
generating an image signal where an image of 
background information is superimposed over 
an image of a conference room photographed 
by a camera; and 

encoding means (462) for encoding an im- 
age signal generated by said image superim- 
posing means (461). 

10. A television conference terminal means, com- 
prising: 

decoding means (565) for decoding an en- 
coded image signal to an original image signal; 
and 

image displaying means (566) for display- 
ing an image corresponding to an image signal 
decoded by said decoding means (565). 

11. A television conference terminal means, com- 
prising: 

encoding means (671) for encoding an im- 
age signal of a conference room photographed 
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by a camera; and 

multiplexing means (672) for multiplexing 
data encoded by said encoding means (671) 
and background information code of the image 
of the conference room photographed by said 
camera. 

12. A television conference terminal means, com- 
prising: 

demultiplexing means (775) for demul- 
tiplexing multiplexed data of encoded image 
data of a conference room and background 
information code thereof into said encoded 
data and said code; 

image decoding means (776) for decoding 
the encoded data of the image signal of the 
conference room demultiplexed by said demul- 
tiplexing means (775) to an image signal; 

image generating means (777) for generat- 
ing an image signal corresponding to the back- 
ground information code demultiplexed by said 
demultiplexing means (776); and 

image superimposing means (778) for 
superimposing the image signal generated by 
said image generating means (777) over the 
image signal decoded by said image decoding 
means (776). 

13. A multi-location control apparatus for use with 
a multi-location television conference system 
that has a plurality of multi-location television 
conference terminal means disposed at three 
or more locations, each of said television con- 
ference terminal means being adapted for 
transmitting and receiving sound data and im- 
age data, said multi-location control apparatus 
being adapted for switching and controlling 
sound data and image data transmitted among 
all of said television conference terminal 
means, said multi-location control apparatus 
comprising: 

code generating means (881) for generat- 
ing a background information code corre- 
sponding to a background information number 
according to an image of a conference room 
that has been received from a television con- 
ference terminal means of the conference 
room that has been switched and selected; 
and 

multiplexing and distributing means (882) 
for multiplexing the encoded data of the image 
signal of the conference room received from 
the television conference terminal means of 
the conference room that has been switched 
and selected and the code generated by said 
code generating means (881) and for distribut- 
ing the multiplexed signal to all the television 
conference terminal means. 


14. A television conference terminal means, com- 
prising: 

signal demultiplexing means (985) for de- 
multiplexing multiplexed signal of encoded im- 
s age data of a conference room and back- 

ground information code thereof received from 
a multi-location control apparatus into said data 
and said code; 

image decoding means (986) for decoding 
10 the encoded data demultiplexed by said signal 

demultiplexing means (985) into the image sig- 
nal of the conference room; 

image generating means (987) for generat- 
ing the image signal of the background in- 
75 formation corresponding to the code demul- 

tiplexed by said signal demultiplexing means 
(985); and 

image superimposing means (988) for 
superimposing the image signal generated by 
20 said background information image generating 

means (987) over the image signal decoded by 
said image decoding means (986). 

15. A multi-location control apparatus for use with 
25 a multi-location television conference system 

that has a plurality of multi-location television 
conference terminal means disposed at three 
or more locations, each of said television con- 
ference terminal means being adapted for 

30 transmitting and receiving sound data and im- 

age data, said multi-location control apparatus 
being adapted for switching and controlling 
sound data and image data transmitted among 
all of said television conference terminal 

35 means, said multi-location control apparatus 

comprising: 

image generating means (991) for generat- 
ing image signal of a background information 
corresponding to a background information 

40 number according to an image of a conference 

room and received from a television confer- 
ence terminal means of the conference room 
that has been switched and selected; 

image decoding means (992) for decoding 

45 the encoded data of the image signal of the 

conference room received from the television 
conference terminal means of the conference 
room that has been switched and selected to 
an image signal; 

50 image superimposing means (993) for 

superimposing the image signal generated by 
said image generating means (991) over the 
image signal decoded by said image decoding 
means (992); and 

55 encoding means (994) for encoding the 

image signal generated by said image super- 
imposing means (993). 
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16. A television conference terminal means, com- 
prising: 

image decoding means (1095) for decod- 
ing encoded data of an image signal received 
from a multi-location control apparatus into an 
original image signal; and 

image displaying means (1096) for dis- 
playing an image corresponding to the image 
signal decoded by said image decoding 
means (1095). 

17. A multi-location television conference system, 
comprising: 

a plurality of television conference terminal 
means disposed at a plurality of locations and 
adapted for transmitting and receiving sound 
data and image data; and 

a multi-location control apparatus disposed 
among said television conference terminal 
means and adapted for switching and control- 
ling data transmitted thereamong, 

wherein each of said television conference 
terminal means is adapted for transmitting im- 
age data where background information of an 
image of a conference room has been super- 
imposed over the image thereof to said multi- 
location control apparatus and for receiving the 
image data that has been switched and se- 
lected by said multi-location control apparatus 
and displaying the superimposed image data. 

18. A multi-location television conference system, 
comprising: 

a plurality of television conference terminal 
means disposed at a plurality of locations and 
adapted for transmitting and receiving sound 
data and image data; and 

a multi-location control apparatus disposed 
among said television conference terminal 
means and adapted for switching and control- 
ling data transmitted thereamong, 

wherein said multi-location control appara- 
tus is adapted for generating image data where 
background information of an image of a con- 
ference room that has been switched and se- 
lected has been superimposed over the image 
thereof, and 

wherein each of said television conference 
terminal means is adapted for receiving the 
image data from said multi-location control ap- 
paratus and for displaying the superimposed 
image data. 

19. A multi-location television conference system, 
comprising: 

a plurality of television conference terminal 
means disposed at a plurality of locations and 
adapted for transmitting and receiving sound 


data and image data; and 

a multi-location control apparatus disposed 
among said television conference terminal 
means and adapted for switching and control- 

5 ling data transmitted thereamong, 

wherein each of said television conference 
terminal means is adapted for transmitting mul- 
tiplexed data of an image of a conference 
room and background information thereof to 

io said multi-location control apparatus and for 

receiving the multiplexed data that has been 
switched and selected from said mufti-location 
control apparatus and for displaying superim- 
posed data of the background information and 

is the image of the conference room. 

20. A multi-location television conference system, 
comprising: 

a plurality of television conference terminal 
20 means disposed at a plurality of locations and 

adapted for transmitting and receiving sound 
data and image data; and 

a multi-location control apparatus disposed 
among said television conference terminal 
25 means and adapted for switching and control- 
ling data transmitted thereamong, 

wherein said multi-location control appara- 
tus is adapted for multiplexing an image of a 
conference room that has been switched and 
30 selected and background information of the 

image, and 

wherein each of said television conference 
terminal means is adapted for receiving the 
multiplexed data from said mufti-location con- 
35 trol apparatus and for displaying the superim- 

posed image data of the background informa- 
tion and the image of the conference room. 
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